I want to classified predicted genes using the COGs (Clusters of Orthologous Groups of proteins) and KEGG (Kyoto Encyclopedia of Genes and Genomes) databases.
The function annotation of COGs like the publication posted below.
For performance reasons, the online sequence mapper of EggNOG v4.5 is currently limitted to one protein sequence at a time. Multi sequence FASTA files are not allowed.
Therefore, I performed local HMMER searches.
Download all HMM models and build a HMMER database using hmmpress. Finally, use hmmscan to query protein sequences against the database.
The following is my full code:
cat bactNOG_hmm/*.hmm > bactDB.hmmer
hmmpress bactDB.hmmer
hmmscan bactDB.hmmer MyQueryFasta.fa
However, I did not get any function annotation of COGs from the output file format. Can anyone please help me out.