Why is eXpress producing strangely low counts?
2
0
Entering edit mode
5.8 years ago
bagi.m ▴ 10

I have aligned our RNA-seq to reference transcriptome (there is no genome) using bwa mem. The reference transcriptome is from the same species but from different population, so some mismatches and indels are expected.

Then I used eXpress to count reads.

The results are strangely low. No reference contig has more than 60 tot_counts. However, when I look at the alignment in tablet or IGV some contigs have up to 500000 reads aligned to them. Moreover, uniq_counts is equal to tot_counts for all contigs. But cca 0.1% of reads in my bam file are multimapped.

Any suggestions what could be wrong?

RNA-Seq • 1.0k views
ADD COMMENT
2
Entering edit mode
5.8 years ago
bagi.m ▴ 10

Ok, this one is embarrassing. I completely misunderstood the eXpress documentation regarding sorting the input .bam file and the fact that you are _not_ supposed to sort it.

ADD COMMENT
1
Entering edit mode
5.8 years ago

Use kallisto instead. The creators of eXpress recommend as a better alternative.

As for what is going on - well I will say that it is difficult to track down not know what the data looks like - and since there are much better alternatives it is not quite worth doing so

For organisms that have significant splicing going on a lot many more of your reads should be multi-mapping in that case something is off. If your organism does not have splicing then there is no reason to use eXpress.

ADD COMMENT
0
Entering edit mode

Thank you for your answer. I am indeed planing to try Salmon with this data, and see if I get to similar conclusion. I found some articles / posts claiming pseudoalignment tools are better, but both were based on human data. So I don't know if this claim holds for organisms with much higher intraspecies sequence variability (where tweaking alignment parameters is sometimes necessary).

ADD REPLY
1
Entering edit mode

Pay attention because Salmon- like eXpress - doesn't want sorted input.

ADD REPLY

Login before adding your answer.

Traffic: 1601 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6