Entering edit mode
3.9 years ago
NewtoBioinfo
•
0
I'm new to bioinformatics. I want to find and translate the first orf (not necessarily the longest) for each FASTA sequence in a multi-FASTA input file and write the translated protein sequence with the corresponding header and frame information to a new file. Could someone please give me the code to do this using Biopyhton?
We're not simply going to give you the code (it's against the forum's policy) but we can help with pointers and tips ;-)
to start, you should check (or only start extracting sequences) when they start with an 'ATG' (= M ), substract from string length 3 .... if the ORF does not, skip it and go to next
Hi NewtoBioinfo ,
Please do not delete posts, especially when they already have received input/answers. This is not good etiquette of this forum.
thanks.