How do I summarize coverage across many genomes to inform filtering cutoffs?
1
0
Entering edit mode
4.8 years ago
selplat21 ▴ 20

Hello,

I used samtools depth to get a textfile of coverage across my genome for ~600 samples. In order to inform coverage cutoffs in the next step I'd like to make a summary/histogram of some sort that summarizes coverage for all of these genomes together.

For example, eventually I'd like to say, filter out reads below x coverage and above y coverage. This would be based on the distribution of coverage for all samples to filter out likely duplicates etc.

Any help is appreciated!

alignment coverage samtools genome next-gen • 1.2k views
ADD COMMENT
0
Entering edit mode
4.8 years ago
onestop_data ▴ 330

Please check this out Filter Bam File Based On Coverage and also take a look int the new samtools named coverage (samtools coverage). Please sure you have the latest samtools.

ADD COMMENT
0
Entering edit mode

Do you mean samtools stats --coverage? I only see this as a subcommand in the new samtools. This is very helpful thank you!

ADD REPLY
0
Entering edit mode

No. samtools coverage It was introduced here (samtools 1.10)

ADD REPLY

Login before adding your answer.

Traffic: 2459 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6