Protein Coding Region Of Reference Assembled Transcript
2
0
Entering edit mode
11.8 years ago
The ▴ 180

Hello All,

I've got some sequences of RNA transcript made by 454 sequencing and assembled using sequences from a reference genome. I have the sequence of the transcript and the corresponding GenBank ID of the sequence used for its assembly. I don't have access to the assembly data.

How I can predict the protein coding sequence of the transcript ? Do I need to align them to the reference sequence first? Is there any particular protocol that people follow? Are there any software tools for doing this?

In all probability the sequences contain some frame shift errors. So any remedial method is much appreciated.

Many Thanks

transcript translation • 2.1k views
ADD COMMENT
0
Entering edit mode
11.8 years ago
Pappu ★ 2.1k

For microbial genome glimmer can recognize coding regions. For eukaryotic genome, look at papers on splice site recognition i.e by Gunnar Rätsch.

ADD COMMENT
0
Entering edit mode
11.8 years ago
The ▴ 180

Hi, My sequences are of Eukaryotic origin. The assembly was guided by mRNA sequences and not by Genome sequences. Thanks

ADD COMMENT

Login before adding your answer.

Traffic: 1686 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6