Entering edit mode
3.1 years ago
NB
▴
960
Hi, Is there a way to extract or calculate the total number of overlapping bases from a CRAM file for paired-end WGS ?
If I get the concept right, it can also be calculated by creating a fragment length count file and read length ?
Thank you
Picard tools understand CRAM format natively so you can simply use
CollectWGSMetrics
. If not CRAM can be converted to BAM on fly and then fed into Picard.Thanks.. is there any explanation on how Picard calculates the metrics - does it include/exlude dups, soft clipped reads, etc ?
Do you mean the total number of aligned reads in the file or the total number of overlapping bases across an interval? Have you looked at mosdepth?
I meant total number of overlapping bases across the whole (human) genome.. does mosdepth output the overlapping bases ? I did not see that output/option on its git repo