Hi,
I just run tophat2 with bowtie1 on the same fastq files with the exact same settings.
Then i am comparing the two bam files. I used cmp command in unix and it keeps telling me that
bam1 bam2 differ: byte 27, line 1
When I look at the first line, i do not see any difference at all.
Is it expected to have this difference between two bam files even when obtained with the same exact settings?
How can i actually see what is the different bit between the two bam files?
cool, i just did that and this time i found the bitwise flags to be different. (one is 417 and the other is 161). Why there is a difference do you think?
That's a multimapping read with (presumably) multiple reported alignments. If all of the alignments are equal, then one can just be randomly chosen as being primary. Presumably, then, this is just a difference in the alignment assigned as primary.
No, bowtie2 has deterministic output by default. Every aligner will be different here. If you need the exact same results for CI testing then just use a small file without multimappers.
Jvarkit is the mightiest. Hi Pierre. Do you think it's possible for CmpBams to give leftmost coordinates difference for certain read as well as CIGAR in two different alignment? I am trying to map clipped reads to find some circle split reads.
Convert the files to SAM (or uncompressed BAM), there are simply too many caveats when thinking about byte by byte comparisons of compressed files.
cool, i just did that and this time i found the bitwise flags to be different. (one is 417 and the other is 161). Why there is a difference do you think?
That's a multimapping read with (presumably) multiple reported alignments. If all of the alignments are equal, then one can just be randomly chosen as being primary. Presumably, then, this is just a difference in the alignment assigned as primary.
That explains all. Thanks for pointing that
Is there a way of setting a seed on this??
No, bowtie2 has deterministic output by default. Every aligner will be different here. If you need the exact same results for CI testing then just use a small file without multimappers.