Question

NCBI ORFfinder - How Does It Work

0

Entering edit mode

6.0 years ago

A Soggy Waffle • 0

NCBI help desk hasn't been very helpful to me on this so far.

I'm looking for any record or docs on the ORFfinder tool. I want to know how it finds ORFs. Not how to use it, but how it determines what is and is not an ORF. Is it regex, HMM, fairy dust? Maybe such docs are posted somewhere but I can't seem to find them.

Thanks

ncbi • 12k views

ADD COMMENT • link updated 6.0 years ago by h.mon 35k • written 6.0 years ago by A Soggy Waffle • 0

1

Entering edit mode

Download the program from here. gunzip it and add execute permissions chmod a+x ORFfinder. Run it to look at in-line help.

ADD REPLY • link 6.0 years ago by GenoMax 151k

0

Entering edit mode

I guess I wasn't clear enough. It's not the arguments and calling the program that's the problem. I want to know how it finds ORFs, algorithmically.

ADD REPLY • link 6.0 years ago by A Soggy Waffle • 0

1

Entering edit mode

If this is the ORF finder I think it is, I don’t think it’s much more sophisticated than finding start and stop codons in all 3 frames on each strand, and highlighting the interval as a potential ORF. Possibly some filtering for minimal sizes.

‘Proper’ gene predictors like Prodigal and Glimmer have much more sophisticated characteristics like sequence complexity, homology, etc.

ADD REPLY • link 6.0 years ago by Joe 22k

0

Entering edit mode

See my note below.

ADD REPLY • link 6.0 years ago by GenoMax 151k

1

Entering edit mode

Video above contains an email address of a person who probably still supports this tool. You can try writing to them directly. It does not contain the information you are looking for.

Note 1: ORFfinder is not meant to be a gene prediction tool for eukaryotes.
Note 2: NCBI help desk can take up to 3 working days to respond. Likely because of volume of queries they get. I have always received an answer for my questions.

ADD REPLY • link 6.0 years ago by GenoMax 151k

score 0 · Answer 1 · 2019-05-16

ORF Finder perform six-frame translation of a nucleotide sequence given a particular genetic code, finding all ORFs possible. It then filters for ORFs above a certain size threshold, and may exclude ORFs nested inside longer ORFs. This information (and nothing more) can be found at the paper Database resources of the National Center for Biotechnology.