Entering edit mode
2.6 years ago
isha.lily20
▴
10
Hello
Is there any direct tool to fetch intron coordinates from transcriptome.gtf file ?
Is there any direct tool available ?
Thank you
It’s simple to parse. Simply take the coordinates annotated as
.`gene`transcript
and those annotated asexon
. Take the complement, sogenetranscript minus exon, that’s the introns.If you google the question there are multiple threads on that here at biostars already.
If you want the exact intron ranges you would need to modify the above by looking per transcript. Doing it per gene would give you regions contained within the introns of all transcripts, but not proper introns per say.
Indeed, good catch!
i have done intergenic minus exon for intron coordinates, that is wrong ?
In a sense, yes it is wrong because it doesn't capture biological complexity of alternative splicing. If you have simple gene models where each gene has a single annotated transcript, then it would suffice, but only because the gene models do not capture the real biology well.
Imagine these hypothetical transcripts for the same gene:
If you look at genes in a genome browser you will find even more complex cases. Therefore, it becomes very clear that introns depend on the actual transcript.