Getting the mRNA sequence from genome annotation
2
0
Entering edit mode
4.3 years ago
colin.kern ★ 1.1k

From my understanding, if an annotated gene from Ensembl, UCSC, etc. is marked as being on the positive strand, that means its sense strand is the exact sequence from the assembly fasta file, and so the mRNA sequence is a copy of the genome assembly sequence for the exon regions (except with uracil replacement). For a gene on the negative strand, then, the reverse complement of the fasta sequence needs to be used to get the mRNA sequence. Is this correct, or do I have this backwards?

sequence gene Assembly • 824 views
ADD COMMENT
0
Entering edit mode
4.3 years ago

Yes that is correct assuming you want the exonic sequence in 5' to 3' order relative to the mRNA. Also note RNA-editing or alternative splicing may change the sequence from the genome reference.

ADD COMMENT
0
Entering edit mode
4.3 years ago

Realistically, if you want a transcript from an annotated organism in ensembl, just get the transcript. Transcripts and exons are always returned in the desired orientation, no matter what strand the transcript is on.

ADD COMMENT

Login before adding your answer.

Traffic: 2063 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6