Do We Have A Read Expression Summarization Tool For Bwa?
1
1
Entering edit mode
10.7 years ago
DoubleDecker ▴ 180

Is there any tool that can help with ambiguous (multimapping) reads in BWA - assign them to more probable transcript based on such things e.g. as mapping depth. We have RSEM, express but they all for some reason seem to be geared towards Bowtie... so is there any other solution?

bwa • 2.8k views
ADD COMMENT
1
Entering edit mode
10.7 years ago
Birdman ▴ 20

Express works well following BWA in my experience. You could also try Samtools. e.g. samtools view -c -F 4

ADD COMMENT
0
Entering edit mode

I tried eXpress but it does not seem to count the reads correctly, summed counts total is significantly lower than what I get when I look at output from samtools flagstat ran on the same reads but mapped without using the -a option in bwa-mem.

ADD REPLY
0
Entering edit mode

Did you try using the same bwa-mem options to compare both methods?

ADD REPLY
0
Entering edit mode

Well, yes, I ran samtools and express on the same input file. In fact, I had a look at express mailing list where they admit that express is not compatible with BWA yet. You can find a script there which should be able to convert bwa-aln output to bowtie output and thus work with both RSEM and express but again, it does not work with bwa-mem.

ADD REPLY
0
Entering edit mode

Hummm... that's surprising. They should state it on their website/documentation if they are aware of this incompatibility. All they say now about the output is "eXpress also requires a file, multiple files, or a piped stream of SAM or binary SAM (BAM) alignments as input. The SAM alignments should be generated by mapping your sequencing reads to the target sequences specified in the multi-FASTA input file described above. For more details on the SAM format, see the specification. Many short read mappers including Bowtie, Bowtie2, BWA, and MAQ can produce output in this format."

ADD REPLY
0
Entering edit mode

I also compared HTSeq-count and eXpress and the results (read counts) are very different. Not sure which I should trust... I get in total 35% less reported reads with htseq but some of them are higher, some are lower. This is very strange taking into account that I'm using the same SAM files.

ADD REPLY
0
Entering edit mode

I asked the question on another forum since nobody here seems to have an answer... (sorry about that)

ADD REPLY

Login before adding your answer.

Traffic: 1678 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6