Read mapping statistics
1
0
Entering edit mode
3.5 years ago
kathrynm • 0

Hello!

I am looking at the read mappings for my taxa and want to make sure I correctly understand what each column represents.

In rPHG, what are the taxa_count and range_count columns generated from readMappingsForLineName?

In the output txt file for each taxon ending with _additionalMappingStats, what are count, totalNM, totalAS, totalDE and listOfStartPos?

If there is already documentation on this that I missed, please let me know! Thanks! Kathryn

phg • 803 views
ADD COMMENT
2
Entering edit mode
3.5 years ago
zrm22 ▴ 40

For rPHG - range_count is the number of reads mapping to a given reference range. taxa_count is the number of those reads mapping to a specific taxon in the range.

For the _attionalMappingStats files, they are just counts of some of the mapping statistics coming from the BAM/SAM files. 'count' is the number of reads which hit a given haplotype. 'totalNM', 'totalAS' and 'totalDE' are the cumulative sum of the NM, AS and DE fields in the BAM/SAM file for a given haplotype.

Finally listOfStartPos are the alignment start positions for each read with respect to the haplotype. Basically it allows you to look at how the reads are distributed along the sequence of the haplotype.

ADD COMMENT
0
Entering edit mode

Thank you!

ADD REPLY
0
Entering edit mode

A small educational note: if an answer was helpful, you should upvote it; if the answer resolved your question, you should mark it as accepted. You can accept more than one answer if they all work. If an answer was not really helpful or did not work, provide detailed feedback so others know not to use that answer.
upvote_bookmark_accept

ADD REPLY

Login before adding your answer.

Traffic: 1573 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6