Entering edit mode
10.0 years ago
2012201024
▴
20
When using BWA/samtools, does anyone know how to output the SNP number and math length (Minimum seed length) of each reads?
For example, BOWTIE returns something like this for each result: 64:G>N,66:G>N (SNP number: 2) or if some other tools can solve.
Many thanks in advance
Doesn't BWA produce MD auxiliary tags? This information is stored in there (likewise with bowtie2 or bowtie1 when using the SAM format).
Thanks a lot.
I have got MD tags through:
Furthermore, I want to analysis the SNPs number (even Nucleotide substitution rate) of each reads compared with the reference.
Are there some python scripts which can solve this?
Thanks a lot.
I have got MD tags through:
Furthermore, I want to analysis the SNPs number (even Nucleotide substitution rate) of each reads compared with reference.
Are there some python scripts which can solve this?
Perhaps someone has written such a tool, but if not it wouldn't be difficult for you to do so.
If I want to calculate nucleotide substitution rate, can I count the number of "A,T,C,G" in MD, divide by M+number of nucleotide substitution? Should I consider the length of InDels? Thanks!
That's what I'd do. You can ignore indels if you just want the substitution rate.