Question

Gene Ontology term enrichment tools that count duplicate/redundant genes

0

Entering edit mode

7.7 years ago

memory_donk ▴ 380

Hi Biostars,

I've got a list of putative enhancer elements that I've predicted based on conservation, epigenetic marks etc etc and a subset of these which are predicted to be rapidly evolving. I would like to do a Gene Ontology analysis on nearby genes. However, most GO tools take your input list of genes and reduce it down to a non-redundant set. This works for many types of datasets but not really for mine. I could in principle have multiple enhancer elements with the same closest genes. I don't think it would make much sense to consider (for example) 2 accelerated elements near Snx10 the same as 7 non-accelerated elements near Snx10. Allowing duplicate genes to be counted more than once in my GO analysis ought to more accurately represent the set of genes which enhancers in either accelerated or non-accelerated groups may be interacting with. Does anyone know of a tool that lets you do this?

Thanks!

GeneOntology Ontology GO Genomics • 2.7k views

ADD COMMENT • link updated 7.7 years ago by Carlo Yague 8.9k • written 7.7 years ago by memory_donk ▴ 380

score 1 · Answer 1 · 2017-02-28

1

Entering edit mode

7.7 years ago

Carlo Yague 8.9k

You could rank your genes based on the number of associated enhancer elements and feed the ranked list into GOrilla.

ADD COMMENT • link 7.7 years ago by Carlo Yague 8.9k

0

Entering edit mode

That's a really interesting idea Carlo, I hadn't thought to rank GOrilla inputs like that.

Edit: Thinking about this though, even ranking still doesn't quite capture the frequency of GO terms associated with nearby genes. In principle, each GO term ought to be counted once for every occurrence of a nearby gene to accurately reflect the dataset.

ADD REPLY • link 7.7 years ago by memory_donk ▴ 380