I would like to perform a statistical test on a 3 way venn diagram. I have 3 RNA Seq data sets that I wish to examine their overlap. I have generated a 3-way venn diagram for these data using the software called "VennPlex" which makes a Venn diagram stratified by up- and down-regulation (i.e. there are 2 numerical values in exclusive and overlapping sectors of the diagram, a value for upregulated genes, for downregulated genes, and in overlaping sectors a third value counter regulation of a shared gene between the two overlapping tables).
My research has lead me to think that a hypergeometric test is the best approach to perform statistics on a venn diagram. The R package "GeneOverlap" is built to do just that, but it can only perform such statistics on the overlap between two tables. That means, I can perform multiple tests on pairwise overlap between each of the 3 tables, but this will only test the hypothesis on the intersect between two tables at a time, there is no sufficient way to test the 3 way intersect sector of the diagram.
My questions are:
- Is the hypergeometric test the appropriate test for assessing overlap in a venn diagram 1a. If yes, what is the appropriate background value to use for this test? Is it the size of the whole mouse genome, or only the total number of significant genes identified for each comparison?
- If I am on the correct track how would I perform a test appropriately on all 4 possible overlapping sectors?
- After testing all these hypothesis do I need to also perform a p-value adjustment for multiple hypothesis testing?