Hi All
I have 3 samples that their alignment was around 70% using Tophat. But with Salmon the alignment dropped to 40 to 50%. The warning was 0.0043844% of fragments were shorter than the k used to build the index (31) And this showed up with samples with good alignment. I changed the k used, but the alignment still 40-50% What do you think could be the reason?
Thanks
There is a difference between alignment and mapping. You are "mapping" with Salmon.
Sharon : You should stop using TopHat for new projects and not use the results as a metric to compare either. STAR, BBMap, HISAT2 etc. are current RNAseq aligners or you could do mapping with salmon/kallisto. Pay particular attention to multi-mapping reads. Each aligner handles them differently. Check is you have rRNA contamination in your data as well.
Got it. Thanks genomax so much. Much appreciated !
As genomax mentions, if you map reads to the genome with a spliced mapper, but salmon doesnt map these reads to the transcriptome, tge big potential culprits are rRNA, intronic retention, or some other sort of genomic contamination. If it comes from an annotated transcript in the reference transcriptome, salmon should be able to map it if the other tools could.
Where did you get the transcript reference (fasta) from?
From here: https://www.ncbi.nlm.nih.gov/projects/genome/guide/human/index.shtml