Why statistic test should I use?
1
0
Entering edit mode
9 months ago
Matheus • 0

I"m comparing two sets( a and b) of genomes in some metrics(genome size, gc content, number of gene). A is a set of about 130 genomes, and B(30 genomes) is a subset inside A that have some characteristics that I want. I have calculated the average of those metrics I both as sets and that I want to know if those averages are significantly different. Which test should I use? I am thinking about one sample t test, but my data is not normally distributed.

Statistics • 632 views
ADD COMMENT
1
Entering edit mode

Without seeing the data, why not just give the Wilcox test a try? It makes no distributional assumptions and 130 vs 30 is big enough of an n to get significances with it. It tests ranks, not means though.

ADD REPLY
0
Entering edit mode

B(30 genomes) is a subset inside A

Regardless of the test you choose, shouldn't you compare B vs (set of A genomes without B genomes)? I can't really tell why, but comparing a subset vs the whole seems odd to me.

ADD REPLY
0
Entering edit mode
9 months ago
shelkmike ★ 1.4k

I think you can do a Mann-Whitney test. Strictly speaking, it does not compare means (the null hypothesis is slightly different from the equality of means), but it probably is still suitable for you.

ADD COMMENT

Login before adding your answer.

Traffic: 2119 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6