Entering edit mode
6.2 years ago
The Last Word
▴
230
This is more of a statistical question. I have 8 gene clusters and a list of gene names. All the clusters are of varying lengths. I have a list of 200 gene names and I found that 60 out genes out of the list fall in the third cluster. How do I statistically check if a significant number of genes from my list fall in the third cluster.
How do I calculate a p-value for the hypothesis that a significant number of genes from my list fall in a specific cluster. The size of each cluster is 431,385,361,291,281,234,201 and 197 genes. So cluster three has 361 genes of which 60 genes are present in my list.
Looks like a gene set enrichment test, which is commonly done using a hypergeometric test as far as I know.