Hi All,
I aligned reads of a viral vector using both Bowtie2 local and BWA-MEM. 1300 reads are mapped to this position.
I observed that BWA shows an insert of 41 bases (in the ITR region), the mapping quality of this read is 60, AS:i:91 and CIGAR string of 53S91M41I7M27I4M.
While bowtie2 local does not show this variant and Mapping Quality is 60, AS:i:182 and CIGAR string 53S91M79S.
I believe bowtie2 softclipped these bases while BWA introduced insert to obtain higher alignment score.
I dont believe this variant is true. Can you please help me understand this, which alignment should I consider to make accurate variant call?
Thanks in advance !!
Hi swbarnes2,
I dont understand what you mean to say. I understand BWA is introducing inserts instead of soft clipping. I dont know why this is happening. Also, this region is ITR and contains CCCGGG repeats.
Does BWA forcefully introduce inserts to obtain high alignment score?
How many reads do you have that are spanning this region (i.e. depth of support for this observation)?
Why does your read need so much end clipping? Do all your reads need that much clipping?
The total depth at this location is 1300 (Reference Reads + Alternate Reads). The variant is found at 10% frequency. This reads map to the ITR regions. Some bases are automatically softclipped by the aligners. Also, the variants are located in the reverse reads.