Question

Open Reading Frame And Codons

2

Entering edit mode

13.2 years ago

Wilmer Garzon ▴ 50

How I can find the ORF in a sequence using Python? also I need find all codons.

Thanks.

orf sequence python codon • 7.0k views

ADD COMMENT • link 13.2 years ago by Wilmer Garzon ▴ 50

score 7 · Answer 1 · 2012-02-08

7

Entering edit mode

13.2 years ago

Nikolay Vyahhi ★ 1.3k

List of all codons:

genome = 'ACGTACGT....'
print map(lambda x: ''.join(x), zip(genome[0:], genome[1:], genome[2:]))

Set of all codons:

genome = 'ACGTACGT....'
print set(map(lambda x: ''.join(x), zip(genome[0:], genome[1:], genome[2:])))

If your genome is large, use itertools.izip instead of zip:

import itertools
itertools.izip(genome[0:], genome[1:], genome[2:])

To find ORF it's better to use Biopython (see zev.kronenberg's link).

ADD COMMENT • link 13.2 years ago by Nikolay Vyahhi ★ 1.3k

0

Entering edit mode

If you genomes are large, use itertools.izip instead of zip: import itertools; itertools.izip(genome[0:], genome[1:], genome[2:])

ADD REPLY • link 13.2 years ago by Nikolay Vyahhi ★ 1.3k

Ram · Answer 2 · 2012-02-08

5

Entering edit mode

13.2 years ago

Zev.Kronenberg 12k

Googled:

ADD COMMENT • link updated 5.6 years ago by Ram 45k • written 13.2 years ago by Zev.Kronenberg 12k

Ram · Answer 3 · 2012-02-09

0

Entering edit mode

13.2 years ago

Wilmer Garzon ▴ 50

Thanks,

I worked with the fasta file NC_005816.fna following the steps indicated in (http://biopython.org/DIST/docs/tutorial/Tutorial.html#htoc224), but if I compare this ORF's with the obtained using toolbox WEB of NCBI http://www.ncbi.nlm.nih.gov/projects/gorf/orfig.cgi the results are differents, why?

I need to use python because the file of my sequence is 4GB. I can't use NBCI toolbox.

Thanks.

ADD COMMENT • link updated 5.6 years ago by Ram 45k • written 13.2 years ago by Wilmer Garzon ▴ 50

0

Entering edit mode

would be more appropriate as a comment. possible reason for difference: different genetic codes.

ADD REPLY • link 13.2 years ago by Michael 55k