Hi All,
In my current work, I am using Kallisto in 3 prime RNA seq data (read length 55bp) from melanoma samples. I am ideally expecting quantification mapped to only one transcript of my gene (canonical) as Quantseq 3' seq quantifies one read per transcript. Unfortunately, I can't see the canonical transcript (longest length here) in quantified output where I could see mapping to other few transcripts which is not the ideal case. Could anyone please explain if Kallisto can be used in my case, and if so why I am not getting abundance for canonical transcript (longest length)?
Why would you make such an assumption? Why would you want to?
As I mentioned, I assume that the longest isoform (here canonical transcript) should contain the sequences that cover the alternative transcripts. So while Kallisto quantification I should ideally see abundance to my longest transcript as it also contains the sequences of alternative transcripts. The reason why I want only canonical transcripts is to quantify the abundance at the gene level and not at the transcript level.
I don't think that's a fair assumption at all.
I see, Can you explain the reason why it is not fair to take the canonical transcripts?
You should read how kallisto assigns reads. You should also realize that the longest transcript isoform doesn't necessarily include all other exons in a given gene.