Question

How to adjust p value after Wilcoxon rank sum test?

0

Entering edit mode

3.4 years ago

TheCatalyst • 0

Can you please help me figure out how the adjusted p-values were obtained in the attached figure. Did they perform Wilcoxon rank sum test and then divided the resultant pvalue with the total number of genes tested?

Fold change of genes located TADs

The figure legend: Expression fold change between cancer (C42B) and normal (RWPE1) cells of genes that are located in large size normal-specific TADs (red, see the symbol ❖ to the right side of graph in c) and not large size normal-specific TADs (blue, solid diamond ♦) (Wilcoxon rank sum test, * adj. p value < 0.05, ** adj. p value < 0.01, *** adj. p value < 0.001)

p-value TADs statistics Hi-C • 2.3k views

ADD COMMENT • link updated 3.4 years ago by Michael 55k • written 3.4 years ago by TheCatalyst • 0

0

Entering edit mode

One cannot deduce the exact method from that partial plot without context (what is a TAD?), I suppose reading the original article and focus on the methods section. Also, the *** seem to indicate highly significant difference, however when looking at the box-plot (which in itself is not ideal) I don't really buy that is something else than an artifact, but it might be due to large number of data points in wilcox test.

ADD REPLY • link 3.4 years ago by Michael 55k

0

Entering edit mode

Yes, there are hundreds of genes (data points) covered by TADs (Topologically Associated Domains). Ref. Rhie et al., 2019, Nat Commun. (link to Methods section) The methods section doesn't mention the stats analysis.

ADD REPLY • link updated 3.4 years ago by Michael 55k • written 3.4 years ago by TheCatalyst • 0

0

Entering edit mode

It does mention it in a way:

Wilcoxon rank sum tests were performed to compare expression levels of genes in TADs and looped to enhancers. Raw p values are adjusted for multiple comparisons using the Benjamini and Hochberg method.

And in the Results section referencing Fig.1d:

By comparing TAD sizes between normal and cancer cells genome-wide (adj. p value < 0.05, Wilcoxon rank sum test), we identified ~520 large size TADs in normal cells that correspond to ~850 smaller TADs in cancer cells. Interestingly, we found that in these altered TADs, relatively more genes showed increased expression in cancer cells than in normal cells (p value < 8.93e-09, Wilcoxon rank-sum test) (Fig. 1d).

However, this in context, mentioning only a p-value (not adjusted p-value), together with a single comparison of two box-plots shown in Fig. 1d confirms my interpretation that this is just a single Wilcox test carried out. Thus, the mentioning of adjusted p-value is redundant for this part of the figure, but not wrong, for a single test p-value = adj. p-value. It might serve a purpose for other parts of the panel figure where multiple testing does occur.

ADD REPLY • link 3.4 years ago by Michael 55k

0

Entering edit mode

Please supply the reference (why supply a figure without a full reference? ;) FWIW, TAD = Topologically Associated Domain...chunks of DNA with physical proximity from HiC experiments.