Map GO Terms to Protein/Gene of E.coli
1
0
Entering edit mode
8.3 years ago
Benni ▴ 30

I have a list of Proteins/Genes and I want to find out, which GO Terms (especially Cellular Component) belong to every Protein. Is there maybe a simple .csv file available online? I looked at Gene Ontology, Uniprot and EcoCyc, but I couldnĀ“t find such a list. Alternatively there could be a online tool, that maps such a list?

Thanks

GO E.coli • 3.3k views
ADD COMMENT
2
Entering edit mode
8.3 years ago
EagleEye 7.6k

Try GeneSCF to retrieve complete list of GO terms with corresponding Genes as simple text file.

./prepare_database -db=GO_all -org=ecocyc

The above command downloads complete GO db as simple text file in following location, 'geneSCF-tool/class/lib/db/ecocyc/' (This will have Biological Process, Molecular Function and Cellular Component as separate files)

Alternatively, you can provide the list of genes to the tool as input. It will cluster all the genes according to GO terms it belongs. Example for Cellular component,

./geneSCF -m=update -i=INPUTgene.list -t=gid -db=GO_CC -o=/ExistingOUTPUTfolder/ -org=ecocyc --plot=yes --background=#NumberOfBackgroundGenes

NumberOfBackgroundGenes=3836 (number derived from total genes from GeneOntology database for E.coli)

ADD COMMENT
0
Entering edit mode

Instead of using the quote tool formatting you may want to switch to code tool (101010 button) for these code examples. It makes it easier to see the code.

ADD REPLY
0
Entering edit mode

Thanks for the information :)

ADD REPLY

Login before adding your answer.

Traffic: 1562 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6