Question

Bwa Alignment Length

0

Entering edit mode

13.1 years ago

Newvin ▴ 360

After performing a transcriptome assembly, I am using BWA to align the reads to my assembled contigs. From the resulting SAM file, I am able to calculate the # of reads that map to each contig, but I would like to normalize this abundance measure by using the length of each alignment.

My naive question: is the length of each alignment equal to the length of the read that is being aligned, or does BWA sometimes align only part of a sequence query to the target?

bwa alignment • 3.9k views

ADD COMMENT • link updated 8.7 years ago by Biostar 20 • written 13.1 years ago by Newvin ▴ 360

0

Entering edit mode

What sequencing platform are you using? Your read lengths vary that much?

ADD REPLY • link 13.1 years ago by Damian Kao 16k

0

Entering edit mode

These are Illumina short reads. The variation in read length occurs because we used an adaptive algorithm to trim the reads based on quality score prior to assembly.

ADD REPLY • link 13.1 years ago by Newvin ▴ 360

score 2 · Answer 1 · 2011-12-09

2

Entering edit mode

13.1 years ago

Jeremy Leipzig 22k

BWA is performing a local alignment. If you see an 'S' in the CIGAR string of the resulting SAM file (e.g 40M36S), that 3' soft clipped end is not part of the alignment.

ADD COMMENT • link 13.1 years ago by Jeremy Leipzig 22k