Hi,
Can i consider an sequence to be full-length cDNA, if the ORF in the sequence aligns with ~99%-100% identity, with 100% coverage to the reference protein? The species i work on does not have any nearest reference.
Could i please know, if there are any tools/scripts available which can run locally to predict full-length cDNA sequences. I know tools like TargetIdentifier and Full-lengther which can run online, but they eat a lot of time (for running BlastX). (edit: bit confused with target identifier terminology (eg: sense complete and partial), full-lengther does not work any more)
I would also like to appreciate if someone guides me in this issue. Pleae spare if i am not able to put properly.
Thanks in advance.
As far I understood, you want to find out genes in your fasta file? Then glimmer is a good option.
@pappu: Thank you, but is not glimmer is especially designed for microbial sequences? I used augustus to do that in that place which gives out the most possible ORF's.
Did you check out the papers by Gunnar Rätsch on ORF predictions?
you mean mGene? My species does not have Genome either to train the model.