Genes associated with GO term
1
1
Entering edit mode
7.8 years ago
mforde84 ★ 1.4k

I've read in an older thread that to retrieve all of the gene names associated with a GO id you use the biomaRt package, eg:

library(biomaRt)
ensembl = useMart("ensembl",dataset="hsapiens_gene_ensembl")
gene.data <- getBM(attributes=c('hgnc_symbol', 
    'ensembl_transcript_id', 'go_id'), filters = 'go_id', 
    values = 'GO:0072599', mart = ensembl)

However, I'm not sure this is actually a correct answer since it returns 1 gene annotation compared to the 109 reported on www.ebi.ac.uk. Is there a more nuanced interpretation of what this one gene is? Is it only genes directly related to the term and no child terms? If so, is it appropriate to retrieve all child terms for the purpose of functional enrichment analysis, or to just use the 1 gene directly related to the term?

GO gene • 4.8k views
ADD COMMENT
0
Entering edit mode

It looks like it's just the genes that are directly related to the term, and no child terms. In order to get those I had to use:

library(org.Hs.eg.db)
gene_list <- data.frame(mget("GO:0072599", org.Hs.egGO2ALLEGS)[[1]])
ADD REPLY
0
Entering edit mode

When I do this code

gene_list <- data.frame(mget("GO:0072599", org.Hs.egGO2ALLEGS)[[1]])

I get back a list of just numbers, not ensembl IDs. Any idea what these are or how to convert to ensembl?

ADD REPLY
1
Entering edit mode

I'm 5 years late, but these will probably be entrez gene IDs you can then convert to gene symbols or ensembl IDs

ADD REPLY
1
Entering edit mode
7.8 years ago
EagleEye 7.6k

Hi, there is one more simple solution with GeneSCF. You can download all GO terms with associated Genes as tab-separated text file using simple command line. See this example.

In your case for Homo sapiens, use

./prepare_database -db=GO_all -org=goa_human

prepare_database usage

ADD COMMENT
0
Entering edit mode

great thats very helpful

ADD REPLY

Login before adding your answer.

Traffic: 2353 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6