Question

Protein Coding Region Of Reference Assembled Transcript

0

Entering edit mode

12.6 years ago

The ▴ 180

Hello All,

I've got some sequences of RNA transcript made by 454 sequencing and assembled using sequences from a reference genome. I have the sequence of the transcript and the corresponding GenBank ID of the sequence used for its assembly. I don't have access to the assembly data.

How I can predict the protein coding sequence of the transcript ? Do I need to align them to the reference sequence first? Is there any particular protocol that people follow? Are there any software tools for doing this?

In all probability the sequences contain some frame shift errors. So any remedial method is much appreciated.

Many Thanks

transcript translation • 2.4k views

ADD COMMENT • link 12.6 years ago by The ▴ 180

score 0 · Answer 1 · 2013-01-24

0

Entering edit mode

12.6 years ago

Pappu ★ 2.1k

For microbial genome glimmer can recognize coding regions. For eukaryotic genome, look at papers on splice site recognition i.e by Gunnar Rätsch.

ADD COMMENT • link 12.6 years ago by Pappu ★ 2.1k

score 0 · Answer 2 · 2013-01-24

0

Entering edit mode

12.6 years ago

The ▴ 180

Hi, My sequences are of Eukaryotic origin. The assembly was guided by mRNA sequences and not by Genome sequences. Thanks

ADD COMMENT • link 12.6 years ago by The ▴ 180