Tools To Find Gene Ontology Term Enrichment
15
27
Entering edit mode
14.7 years ago
Biostar User ★ 1.0k

I need to make a recommendation to people working in a wet-lab looking for an easy to use tool that does GO term enrichment determination. For those unfamiliar with the concept it means that given a list of gene names they want to find out which gene ontology terms are present in numbers that are above random chance.

There is a huge list here yet a random sampling of the tools mentioned there has lead me to many non-working sites. Other tools seem out of date or just not reliable.

What tool do you use to solve this problem?

Thanks

gene enrichment • 63k views
ADD COMMENT
4
Entering edit mode

This 2016 pre-print paper assess the impact of using obsolete Gene Enrichment tools (e.g. DAVID not being updated for years) in the current literature: http://biorxiv.org/content/early/2016/04/19/049288

ADD REPLY
1
Entering edit mode

The list was updated again to this link

ADD REPLY
0
Entering edit mode

Your link to the list of geneOntology-related tools is perhaps not the best given your problem, perhaps this one is more related: http://www.geneontology.org/GO.tools.shtml#statistical

ADD REPLY
13
Entering edit mode
14.7 years ago
Dave Bridges ★ 1.4k

I prefer to use DAVID but I'd be interested to hear what other people like. There are also R packages available through Bioconductor (GOstats) that can do enrichment determination, but I am less familiar with those, and you need a good working knowledge of R to use these.

ADD COMMENT
8
Entering edit mode

The annotations underlying DAVID have not been updated in six years and DAVID should no longer be used. The paper "Impact of knowledge accumulation on pathway enrichment analysis" explores some of the problems in depth. I know of publications that were rejected in part because they used DAVID.

ADD REPLY
1
Entering edit mode

I also like DAVID.

If you're ruby users, I wrote a very basic library that allows you to query DAVID from your scripts: http://github.com/chrisamiller/davidapi

ADD REPLY
1
Entering edit mode

There is also a Bioconductor package DAVIDQuery (http://bioconductor.org/packages/release/bioc/html/DAVIDQuery.html) for accessing DAVID programmatically.

ADD REPLY
0
Entering edit mode

DAVID is the one I always recommend to our wet lab biologists too. It's now been maintained well for several years, unlike many bioinformatics resources..

ADD REPLY
9
Entering edit mode
14.7 years ago

The BiNGO plugin for Cytoscape will allow you to determine term enrichment in a Cytoscape network. It's quite a neat tool.

ADD COMMENT
0
Entering edit mode

Tried BiNGO to calculate enrichment of a set of nodes. Excellent tool. Thanks for the suggestion

ADD REPLY
0
Entering edit mode

I found BiNGO extremly handy while I was dealing with annotations of non-model organism. For future use I have documented it here.

It also allows us to save the text output, including stats, which particularly helped me in making custom barplots and pie charts of GO terms.

Nice tool !!

ADD REPLY
6
Entering edit mode
14.7 years ago
brentp 24k

it's very much in progress, but a colleague and I (mostly him) have been working on this for python: http://github.com/tanghaibao/goatools/

it has a command-line script to find terms that are enriched in a study group. it reports p-value for various multiple testing corrections as well as the false discovery rate.

It can also be used to plot the DAG of a particular GO term.

ADD COMMENT
5
Entering edit mode
14.7 years ago

I think most of the enrichment analysis tools deals with same class of statistics methods (p-value, FDR, Boneferroni etc). Defining background is a very important in such enrichment methods. To get real meaning of enrichment with respect to your experiments, you should be able to upload the background. For example, if you are looking at a set of a genes from a particular tissue, a background of that tissue give more meaningful results than a background of whole genome.

ADD COMMENT
1
Entering edit mode

Hi Khader Shameer,

Yeah I read a paper in which they did the Gene Ontology Term Enrichment analysis exactly like you said here. And I wanna analyse my data in that way (significantly differentially regulated transcripts Enrichment analysis against unchanged background transcripts). My data is RNQ-seq, so how could I make regulation level unchanged transcripts set as a background analyse Ontology Term Enrichment? What kind of approach or tool should I use there? Would u give me some suggestions?

ADD REPLY
3
Entering edit mode
14.7 years ago

You can try one of the tools at babelomics, in particular FatiGO; or as an alternative, you can use the same AmiGO Term Enrichment tool.

However, be careful when using GeneOntology: it is a very active and supported project, so they make big enhancements between two releases. If you look at thei bug tracker, there are at least 8-10 changes to geneontology terms every day. So, annotate the version and date of GeneOntology if you want your experiment to be reproducible.

ADD COMMENT
3
Entering edit mode
14.7 years ago
Allpowerde ★ 1.3k

Another option is GONOME (http://gonome.imb.uq.edu.au/), which finds the over- and under-represented GO terms for a given set of genomic positions.

ADD COMMENT
3
Entering edit mode
14.7 years ago

GOrilla makes nice pictures.

ADD COMMENT
3
Entering edit mode
14.6 years ago
Stew ★ 1.4k

David is a great place to start. I also like GeneTrail, which can do GSEA analysis as well as standard enrichment on a gene list. Out of the web world I really like ClueGO and to a lesser degree BInGO in Cytoscape.

In case they were ever useful I put up some slides from a short course I give on functional analysis using web tools and Cytoscape.

ADD COMMENT
3
Entering edit mode
13.9 years ago
Dejian ★ 1.3k

WEGO is a convenient tool to plot the enriched GO items. http://wego.genomics.org.cn/cgi-bin/wego/index.pl

ADD COMMENT
3
Entering edit mode
13.0 years ago
boczniak767 ▴ 870

WebStart version of Ontologizer will be perfect. If you provide association and OBO files to your wet-lab colleagues they easily could use this program.

ADD COMMENT
2
Entering edit mode

+1 for ontologizer. It's easy to download and use as part of a pipeline since it's just a java program.

ADD REPLY
2
Entering edit mode
14.7 years ago
Ian Simpson ▴ 960

Like several of the others I also recommend DAVID to wet lab biologists. It is well maintained, but you should check the version on the particular species annotation(s) they are currently using as it it sometimes not the latest.

They use a variant of the Fisher exact statistic for their p-value calculations called the EASE score which they wrote up in a paper a few years back http://www.ncbi.nlm.nih.gov/pubmed/14519205 which is more conservative that the standard.

ADD COMMENT
0
Entering edit mode

Just to echo other comments (note my original response was 6 years ago !). Don't use DAVID. We don't.

ADD REPLY
2
Entering edit mode
13.0 years ago
fransua ▴ 390

another one here:

http://www.blast2go.com/b2ghome

userfriendly and complete, with many tools regarding to visualization import/export and statistical tests. free version is sufficient for almost every work

ADD COMMENT
2
Entering edit mode
8.5 years ago
Surya Saha ▴ 260

Just wanted to add a note that DAVID has not been updated since 2010 so should not be used. See preprint for more detailed discussion.

ADD COMMENT
0
Entering edit mode
8.1 years ago
jin ▴ 80

For plants, you can try PlantRegMap, which integrated GO annotation for 132 plant species and perform enrichment analysis.

ADD COMMENT
0
Entering edit mode
4.7 years ago
tarashanst • 0

This is an extremely light-weight tool written in Python that can do what you need: LightGOEA

ADD COMMENT
0
Entering edit mode

Dear tarashanst,

It looks good. But the point of GO enrichment tools is to reduce the workload from the user's side. If the user has to prepare their own GO term dictionary, what exactly LightGOEA does? Looks like it only matches the user-provided gene list with the user-provided GO terms with associated gene list. LightGOEA's only function is the calculate overlap statistics.

it would be convenient if the user just has to provide the gene list to the tool and get the work done. For example, have a look at the GeneSCF enrichment tool.

ADD REPLY

Login before adding your answer.

Traffic: 1827 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6