Find the first ORF of multiple fasta sequences using biopython

0

Entering edit mode

4.6 years ago

NewtoBioinfo • 0

I'm new to bioinformatics. I want to find and translate the first orf (not necessarily the longest) for each FASTA sequence in a multi-FASTA input file and write the translated protein sequence with the corresponding header and frame information to a new file. Could someone please give me the code to do this using Biopyhton?

python translate fasta • 996 views

ADD COMMENT • link 4.6 years ago by NewtoBioinfo • 0

0

Entering edit mode

We're not simply going to give you the code (it's against the forum's policy) but we can help with pointers and tips ;-)

to start, you should check (or only start extracting sequences) when they start with an 'ATG' (= M ), substract from string length 3 .... if the ORF does not, skip it and go to next

ADD REPLY • link 4.6 years ago by lieven.sterck 15k

0

Entering edit mode

Hi NewtoBioinfo ,

Please do not delete posts, especially when they already have received input/answers. This is not good etiquette of this forum.

thanks.

ADD REPLY • link 4.6 years ago by lieven.sterck 15k

Login before adding your answer.