how to find coordinates of 5prime and 3prime utr from transcriptome data generated after cuffmerge only contains exon in gtf
1
0
Entering edit mode
2.5 years ago
isha.lily20 ▴ 10

Hello

I am beginner in computational field, please help me to find coordinates of 5primeUTR & 3primeUTR from transcriptome data generated after cuffmerge contains only exon in GTF file, screen short attached

Do I need direct scripts or is any other direct software/ tool available?

Thank you so much, please help, suggestions required

file:///home/scis/Desktop/Screenshot%20from%202022-04-20%2011-58-44.png

transcriptome cuffmerge • 658 views
ADD COMMENT
2
Entering edit mode
2.5 years ago

As far as I am aware, none of the standard transcript assembly tools (cufflinks/stringtie etc) have the ability to annotate the open reading frame within a denovo assembled transcript, they simply say which part of a the genome is transcribed and which is not.

We have attempted to do this in the past in two different ways. Neither approach is 100% perfect. I can also think of a third approach that might work, but we havn't tried.

  1. Obtain the sequence of each of the newly assembled transcripts, probably as a Fasta file, and either write a script or use one of the existing tools, to find the first or longest ORF within the sequence. The UTRs are then the sequences before or after this location. E.g. https://www.ncbi.nlm.nih.gov/orffinder/ or https://stackoverflow.com/questions/31757876/python-find-longest-orf-in-dna-sequence
  2. If you have an already annotated reference transcriptome, and you are just looking for additional transcripts, for each novel transcript you can look for a reference transcript with an identical intron chain within the coding sequence, and whose ATG and stop codon is the within the novel transcript and transfer over the coordinates
  3. If there are protein annotations for this, or a related speices, you can use tblastn to align your novel transcript sequences against the protein database, and identify the open reading frames that way.
ADD COMMENT

Login before adding your answer.

Traffic: 1675 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6