I am running different read alignments and these are the results I have got.
The columns are reference sequence name, sequence length, # mapped read-segments and # unmapped read-segments
The code used to execute the programs is without any option applied. Just as show in the synopsis of their respectively documentation
Someone could explain why being BWA-SW similar to BWA-MEM, results are so different in term of execution time and number of unmapped reads? Based on documentation, I was expecting a long execution time for BWA-backtrack as my read are around 150bp. However, this was 1 second faster.
Thanks for your answer.
I think there is something wrong in these data or something I am missing. I have run samtools flagstat on these BAM files and the results do not match with the table I have generated. The % mapped reads are (in the same order
All match apart from BWA-SW. Why percentage of mapped is lower when there is 0 unmapped reads in the table I generated with samtools idxstats??? I expected 100??
reproducing statistics is always a mystery - all it takes just one slight corner case to throw things off, a bug etc.
I would not worry too much as long as the results are close, bwasw as far as I know is an obsolete methodology anyway, why bother spending effort there. One way to get the odd result is that unmapped reads are reported as secondary alignments. After all every read could align in a secondary fashion.
the rabbit holes are always deep, not worth going down unless it is part of the main analysis
I found the problem.At the end of the results provided by idxstats, unmapped reads with not chr identified are shown. I missed this last line. And there is the small proportion of unmapped read that was found by flagstat