Question

How does Salmon handle multimapped reads?

0

Entering edit mode

3 months ago

marionette.kent • 0

Hi all,

It's a bit of a bait question but also something I want to ask. I have been using the default nf-core pipeline for rnaseq analysis and the output was always fine for DGE. However, recently I started using ENCODE gtf instead because of the warning on the nf-core that the AWS iGenome option may not be the most accurate. Now I have genes of the same name but multiple ensembl ids.

Here comes the question: if I want to just add the counts of the genes sharing the same name, I will need to know if multimapped reads are counted mutliple times, once, or dropped off. When aligning with star, they didn't use the flag --outFilterMultimapNmax. So it is set 10, and the output should have tagged the reads that are multimapped. Then I quantify it with salmon, using the star output. I have read through the document but there wasn't any explanation as to whether salmon is aware of these multimapped reads as a quantifier, or how they would handle it if they know it is multimapped.

Would anyone be able to point me to the right document or explain how it works to me? Thanks a lot

rnaseq multimapping nf-core salmon • 608 views

ADD COMMENT • link updated 3 months ago by GenoMax 149k • written 3 months ago by marionette.kent • 0

score 1 · Accepted Answer · 2024-12-03

1

Entering edit mode

3 months ago

GenoMax 149k

This was discussed in: Big differences between mappings computed by Salmon and quantification

@Rob (developer of salmon) has provided clarifications about salmon in
Mapping vs Quasi-Mapping
Could you explain the difference between STAR, KALLISTO, SALMON etc. to experimental Biologist/non-bioinformatician
https://genomebiology.biomedcentral.com/articles/10.1186/s13059-020-02151-8

ADD COMMENT • link 3 months ago by GenoMax 149k

0

Entering edit mode

Thanks GenoMax I have read those post but I was under the impression those were referring to the quasi-mapping mode? Or have I misunderstood and in fact both the quasi-mapping mode and alignment mode make use of the same EM model for quantification with the difference of a pseudoalignment input and an actual alignment input?

ADD REPLY • link 3 months ago by marionette.kent • 0

1

Entering edit mode

Rob confirmed that salmon accounts for multimapping in both modes.

ADD REPLY • link 3 months ago by GenoMax 149k

0

Entering edit mode

Thank you very much for the answer and the patience!

ADD REPLY • link 3 months ago by marionette.kent • 0