Question

ORF translation of a huge number of DNA sequences

0

Entering edit mode

7.5 years ago

l.souza ▴ 80

I'm looking for a software which I can translate the open reading frame of a lot of sequences at the same time. I've tried to use EMBOSS Sixpack, but even in the local software there's a limit of sequences to input. What can I use?

DNA sequence translation ORF • 2.6k views

ADD COMMENT • link updated 7.5 years ago by h.mon 35k • written 7.5 years ago by l.souza ▴ 80

2

Entering edit mode

Do you have to extract the ORF, or your dna sequences are ATG-STOP already? Eitherway, Biopython can make quick work of it. Also, see this post.

ADD REPLY • link 7.5 years ago by st.ph.n ★ 2.7k

0

Entering edit mode

I need to extract the ORF! Can I do this with Biopython?

ADD REPLY • link 7.5 years ago by l.souza ▴ 80

0

Entering edit mode

You will need to know a little more about your sequences. Are the ORFs in the forward or reverse direction? If reverse, you'll need to take the reverse complement, and then find 'ATG' in a window search of 3 nucleotides. Do you happen to know the approximate length of the ORFs? You will have to then search for stop codons. With finding 'ATG', you can then translate in the forward direction from the first position.

ADD REPLY • link 7.5 years ago by st.ph.n ★ 2.7k

0

Entering edit mode

They are in the forward direction. They have something about 7000nt.

ADD REPLY • link 7.5 years ago by l.souza ▴ 80

0

Entering edit mode

see Emboss sixpack replacement : download & install the standalone version.

ADD REPLY • link 7.5 years ago by Pierre Lindenbaum 164k

score 0 · Answer 1 · 2017-06-06

0

Entering edit mode

7.5 years ago

h.mon 35k

You can split your file into smaller files with GenomeTools (gt splitfasta -numfiles 60 seqs.fasta), faSplit by Jim Kent or fasta-splitter by Kirill Kryukov, then loop or parallel through the files.

ADD COMMENT • link 7.5 years ago by h.mon 35k

0

Entering edit mode

Sorry, but I don't understand why I should split my files into smaller ones...

ADD REPLY • link 7.5 years ago by l.souza ▴ 80

0

Entering edit mode

but even in the local software there's a limit of sequences to input

ADD REPLY • link 7.5 years ago by h.mon 35k

0

Entering edit mode

I mean a limit in the number of sequences.

ADD REPLY • link 7.5 years ago by l.souza ▴ 80

1

Entering edit mode

fasta-splitter and faSplit can split by number of sequences.

I thought gt could split by number of sequences, apparently it can't.

ADD REPLY • link 7.5 years ago by h.mon 35k

0

Entering edit mode

I'm gonna try it... But I would be glad if I'd find something to run all at once.

ADD REPLY • link 7.5 years ago by l.souza ▴ 80