I"m comparing two sets( a and b) of genomes in some metrics(genome size, gc content, number of gene). A is a set of about 130 genomes, and B(30 genomes) is a subset inside A that have some characteristics that I want. I have calculated the average of those metrics I both as sets and that I want to know if those averages are significantly different. Which test should I use? I am thinking about one sample t test, but my data is not normally distributed.
Without seeing the data, why not just give the Wilcox test a try? It makes no distributional assumptions and 130 vs 30 is big enough of an n to get significances with it. It tests ranks, not means though.
Regardless of the test you choose, shouldn't you compare B vs (set of A genomes without B genomes)? I can't really tell why, but comparing a subset vs the whole seems odd to me.