Find the first ORF of multiple fasta sequences using biopython
0
0
Entering edit mode
3.9 years ago

I'm new to bioinformatics. I want to find and translate the first orf (not necessarily the longest) for each FASTA sequence in a multi-FASTA input file and write the translated protein sequence with the corresponding header and frame information to a new file. Could someone please give me the code to do this using Biopyhton?

python translate fasta • 870 views
ADD COMMENT
0
Entering edit mode

We're not simply going to give you the code (it's against the forum's policy) but we can help with pointers and tips ;-)

to start, you should check (or only start extracting sequences) when they start with an 'ATG' (= M ), substract from string length 3 .... if the ORF does not, skip it and go to next

ADD REPLY
0
Entering edit mode

Hi NewtoBioinfo ,

Please do not delete posts, especially when they already have received input/answers. This is not good etiquette of this forum.

thanks.

ADD REPLY

Login before adding your answer.

Traffic: 1643 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6