Is there any fast and reliable way to annotate protein sequences from bacterial genomes to the GO terms?
The best option so far seems to install and run a local copy of InterproScan, but its annotations go way beyond my needs and its computationally very expensive. Also, there doesn't seem to be a way to speed-up the analysis to obtain just the GO-terms. Maybe reducing the number of inspected DBs?
Any other tool/database that perform an annotation in reasonable time would be fine.
Thanks
Update: turned out that iprscan version 5 is less aggressive and a bit faster than previous versions: the annotation of ~10'000 proteins took a little more than one day.
Update 2: the new "mapper" feature of EggNOG (at the time of this edit still in beta) seems even better and way faster (code to have a stand-alone version is here).
Hi @mgalactus, did you finally go through InterproScan path? I'm in the same boat. I need GO terms for my bacterial proteins.
Hi, I actually think that the new beta feature of EggNOG could be way faster snd more useful: http://beta-eggnogdb.embl.de/#/app/seqmapper and code here: https://github.com/jhcepas/eggnog-mapper