How Does Bwa Report Multi-Mapped Reads
1
0
Entering edit mode
10.8 years ago
bbio ▴ 90

I am mapping some RNA-seq data with bwa and would like to do some analysis on where multi-mapped reads fall.

I know that I can extract multi-mapped reads by looking for mapq < 23 and/or the XA flag on the reads. However, I am wondering how bwa decides which location to report for a read that can be mapped to two different locations equally well. Does it choose a random one? Does it always report the first one? Something else?

Does anybody know what exactly bwa does here?

bwa mapping • 7.2k views
ADD COMMENT
3
Entering edit mode
10.8 years ago

A random location is selected.

Also see this post, although not specifically bwa related but it goes to show that things can go wrong:

ATTENTION: bowtie2 and multiple hits

ADD COMMENT
0
Entering edit mode

Do you have any idea if the MAPQ will be 0 in case of multiple mapping or something else? (I read two opinions about that)

ADD REPLY
0
Entering edit mode

The MAPQ=0 is a convention that bwa uses and not a standard.

And even considering it a convention it is not quite right. Having a multi-mapped read does not mean that the chance of the alignment being correct is zero.

The best way to detect multimapping is the check the SAM tag for alternative mappings.

ADD REPLY

Login before adding your answer.

Traffic: 2015 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6