Question

Why statistic test should I use?

0

Entering edit mode

17 months ago

Matheus • 0

I"m comparing two sets( a and b) of genomes in some metrics(genome size, gc content, number of gene). A is a set of about 130 genomes, and B(30 genomes) is a subset inside A that have some characteristics that I want. I have calculated the average of those metrics I both as sets and that I want to know if those averages are significantly different. Which test should I use? I am thinking about one sample t test, but my data is not normally distributed.

Statistics • 959 views

ADD COMMENT • link updated 17 months ago by dariober 15k • written 17 months ago by Matheus • 0

1

Entering edit mode

Without seeing the data, why not just give the Wilcox test a try? It makes no distributional assumptions and 130 vs 30 is big enough of an n to get significances with it. It tests ranks, not means though.

ADD REPLY • link 17 months ago by ATpoint 88k

0

Entering edit mode

B(30 genomes) is a subset inside A

Regardless of the test you choose, shouldn't you compare B vs (set of A genomes without B genomes)? I can't really tell why, but comparing a subset vs the whole seems odd to me.

ADD REPLY • link 17 months ago by dariober 15k

score 0 · Answer 1 · 2024-02-07

0

Entering edit mode

17 months ago

shelkmike ★ 1.6k

I think you can do a Mann-Whitney test. Strictly speaking, it does not compare means (the null hypothesis is slightly different from the equality of means), but it probably is still suitable for you.

ADD COMMENT • link 17 months ago by shelkmike ★ 1.6k