How To Get The Gene Names To Go Terms For Bacteria And Archaea
2
0
Entering edit mode
11.1 years ago
liupfskygre ▴ 210

Hi, I want to use goseq to do GO enrichment of RNAseq data of a bacteria and archaea. But I was stucked by the gene to GO term list. I have try the biomart method here(http://seqanswers.com/forums/showthread.php?t=21829) in R, but unfortunately, biomart now seems not support bacteria now. so how could I fetch a list of gene to GO terms?

Thanks in advance!

Pengfei

go • 6.5k views
ADD COMMENT
2
Entering edit mode
11.1 years ago
Emily 24k

Hi Penfei

Ensembl Bacteria have had to stop supporting BioMart because we've expanded our database so that now we have 9000 genomes and we don't have the resources to support BioMart for that number. You should be able to get the data via the Perl API. There's a free course on using the Ensembl API here. There's a few extra points that you need for accessing prokaryotes that you can learn about here.

ADD COMMENT
0
Entering edit mode

Thank you for your reply, I will learn how to do it.

ADD REPLY
0
Entering edit mode

Thanks, Emily.

Do you have any suggestions on getting GO IDs using ensembl gene ids using R?

I was also stucked at retrieving GO ids (bacteria_ensembl not support biomart anymore).

ADD REPLY
0
Entering edit mode

Just like Pengfei, you'll have to look into the API I'm afraid.

ADD REPLY
1
Entering edit mode
11.1 years ago
Arnaud Ceol ▴ 860

You can try directly from Uniprot. As an example: all GO terms associated to the SecA protein of E.coli:

http://www.uniprot.org/uniprot/?query=gene%3aseca+AND+organism%3a83333&format=tab&columns=id,protein%20names,genes,go,go-id

ADD COMMENT
0
Entering edit mode

Thank you very much!, I will also give it a try!

ADD REPLY

Login before adding your answer.

Traffic: 1971 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6