Entering edit mode
9.2 years ago
tiago211287
★
1.5k
I started to analyse GO enrichment with my genes list using the R package clusterProfiler, that get data from DAVID. Then, a doubt came out about what set of genes should I use in my universe when looking for enrichment?
All annotated genes?
Only annotated genes expressed in the tissue I am studying? (Mine is heart mouse)
Only genes present in my GTF file?
Thank you
Enrichment analysis is based on comparing two lists and what these two lists should be depends on the question one asks. Which question are you interested in ?
1 - Does my list contain more genes of type X than expressed in the mouse heart ?
2- Does my list contain more genes of type X than found in my GTF file ?
Depending on the context (i.e. what the list and the GTF file represent), both are valid questions.
Typically, the background gene list is made of all genes tested in the experiment.