Hello everyone,
I would like to ask anyone who know about How to use result from GeneMark-ES program to identify function? For now I already have a result like below of this post. And It's include Nucleotide sequences output but I would like to use Amino acid sequences to identify function. That's the problem that I would like to ask suggestion by anyone who know to solve this problem. I cannot translate those Nucleotide sequence to Amino acid sequence because It's still include intron when just translate Nucleotide sequence to Amino acid sequence.
Output from GeneMark-ES : *Eukariotyc GeneMark.hmm version 3.49 Sequence name: /storage/home/nuthatai.sut/CallORF_Tool/gm_et_linux_64/gmes_petap/output/data/dna.fa_15 FASTA defline: >dna.fa_15 15_dna 1 30316 Sequence length: 30316 bp G+C content: 27.92% Matrices file: /storage/home/nuthatai.sut/CallORF_Tool/gm_et_linux_64/gmes_petap/output/gmhmm.mod Tue Oct 25 17:52:02 2016 Predicted genes/exons Gene Exon Strand Exon Exon Range Exon Start/End # # Type Length Frame 1 2 - Terminal 6981 7014 34 3 3 - - 1 1 - Initial 7171 7244 74 2 1 - - 2 2 - Terminal 16420 16424 5 3 2 - - 2 1 - Initial 16476 16509 34 1 1 - - 3 1 + Initial 26431 26436 6 1 3 - - 3 2 + Terminal 26642 26644 3 1 3 - -
nucleotide sequence of predicted genes
gene_1|GeneMark.hmm|108_nt ATGTCATCCCTTACTTTGCATCAACAGGCCTACTACACGATAGCACCCGCCGGAATGTCC ATTTGGACTGAACGTAAGAAAGGCGACGTCATGACCAAGACAGTATAA gene_2|GeneMark.hmm|39_nt ATGTTTCTACCAAACATCGGATTTAACTCACCAGGATGA gene_3|GeneMark.hmm|9_nt ATGAATTGA
end nucleotide sequence*
Thank you for advance everyone
blastx
it on NCBI's site...? https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastx&PAGE_TYPE=BlastSearch&LINK_LOC=blasthomeAnd be aware, GeneMark only predicts CDS sequences. There's no guarantee that actually is one - and if I'm reading your copied sequences correctly, GeneMark thinks it found 3 that are 108nt or less? I'd say its pretty unlikely that anything that short codes for something meaningful.
I have another CDs predictor that another tool have Amino acid sequence. That's reason why I would like to convert GeneMark-ES output to Amino acid sequences. And I would like to thank you for your help, if you have any suggestion please let's me know.
Thank you
If you just need a tool that does translation for you, use ExPASy: http://web.expasy.org/translate/
Again, note that this is not an exact science for unknown organisms, you'll be relying on a generic version of whatever translation table you choose which may not be exactly right for your organism - whatever it is.
Thank you for your suggestion.