Hello
I have MiSeq paired-end samples that I aligned before using the classical approach (bwa aln
, bwa sampe
, bwa fixmate
) and that I realigned using bwa_mem
.
Once the bam files generated and sorted I run samtools flagstat
to get the difference between the two results and here is what I found :
bwa
3685558 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
452652 + 0 mapped (12.28%:nan%)
3685558 + 0 paired in sequencing
1842779 + 0 read1
1842779 + 0 read2
158326 + 0 properly paired (4.30%:nan%)
426744 + 0 with itself and mate mapped
25908 + 0 singletons (0.70%:nan%)
139248 + 0 with mate mapped to a different chr
80182 + 0 with mate mapped to a different chr (mapQ>=5)
and bwa mem
:
3937425 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
3216740 + 0 mapped (81.70%:nan%)
3937425 + 0 paired in sequencing
1968780 + 0 read1
1968645 + 0 read2
291160 + 0 properly paired (7.39%:nan%)
2523397 + 0 with itself and mate mapped
693343 + 0 singletons (17.61%:nan%)
2097921 + 0 with mate mapped to a different chr
1488875 + 0 with mate mapped to a different chr (mapQ>=5)
Checking the mapped reads only using samtools -c -F4
I found 3216740 for the method with bwa mem
and 452652 for the classical bwa
method
Although this points to that bwa mem
is much better, I find it strange that the mate mapped to different chr is huge with bwa mem in comparison to bwa.
Any idea on the significance of such difference ? on some locations I see huge coverage in comparison to the alignment generated with the classical bwa version
Thanks in advance
That's actually 15% (bwa mem) mapped to different chromosome against 30% (bwa), which makes of bwa mem result better than the first one, am I correct ? especially with high % of mapped reads