To compare a set of genes is different from all the genes, which test should be used.
1
0
Entering edit mode
9.5 years ago
Zhilong Jia ★ 2.2k

Here there are two list of genes. One is a set of genes (about 500 genes), the other is all the genes (about 20k). And each gene has a number meaning the number of genes interacted with it. To compare whether or not they are different concerning the number of interacted genes, which kind of test should be used? Thank you.

A = c(4, 5, 40, 3, 0, 0) #length(A)=500
B = c(1,2,0,0,0,43,65,...) #length(B)=20000

Genes from A are also from B.

statistics • 1.5k views
ADD COMMENT
2
Entering edit mode
9.5 years ago
russhh 5.7k

All normal approximations are out-of-bounds in this setting. I think your best bet is Kruskal-Wallis.

However, I'd temper your enthusiasm for the very low p-values it might return, as there are some serious acquisition biases in interactome data (and KW provides no way to control for them)

A <- rpois(50, 10)
B <- rpois(50, 100)
kruskal.test(list(A, B))
ADD COMMENT
0
Entering edit mode

Yes, there are biases in interactome data, esp. lots of 0s. Any other test which could be used? Thank you.

ADD REPLY
0
Entering edit mode

Did it not work?

ADD REPLY
1
Entering edit mode

It should work.

ADD REPLY
0
Entering edit mode

Yes, it works. Thank you.

ADD REPLY

Login before adding your answer.

Traffic: 2004 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6