Mann-Whitney Test/Wilcoxon but for individual genes between case and control
1
0
Entering edit mode
4.1 years ago
VBer ▴ 200

Hello,

I am trying to replicate the DEG as performed in a RNA-Seq study. The steps below are exactly what the authors have mentioned in their methods section.

I have 8 cases and 20 controls (clinical data). I have taken the mean normalized expression for each gene in the control, and divided the normalized counts in the cases individually by sample using control mean to get the fold change.

Now I want to know if the fold change is significant or not. The authors have seemed to have performed a Mann-Whitney test. However, I am not sure how the authors went about this. Their results have individual p-values for each gene. I am not sure how to do that as wilcox.test gives an overall p-value (which is significant actually) when I use the mean case vs mean control normalized counts.

RNA-Seq Statistics • 2.4k views
ADD COMMENT
0
Entering edit mode

The best way to approach this would be to email the authors.

ADD REPLY
0
Entering edit mode

They aren't very responsive :'( I had to email them several times just for their raw counts data :'(

ADD REPLY
0
Entering edit mode

How recent is the paper? Unfortunately, if authors are not open to others reproducing their results, you will face such barriers. Keep at it, maybe future replies will be faster. If not, see if you can switch papers without compromising on your goal.

ADD REPLY
0
Entering edit mode

This year in June!

And thank you! I don't feel as disheartened as before :)

ADD REPLY
1
Entering edit mode
4.1 years ago
caggtaagtat ★ 1.9k

Could it be, that the authors just performed a Mann-Whitney test on the normalized expression values per gene? Its basically similar to a t-test, where you have two population of values. In this case one population would be the normalized expression values of the 20 control cases and the second population would be the normalized expressen values of the 8 cases, per gene.

ADD COMMENT
1
Entering edit mode

Hi, just an update, I just used DESeq2 because I could not very well replicate or use the same functions as the authors. The number of DEGs reported are drastically lesser... But that's fine by me.

Thanks!

ADD REPLY
0
Entering edit mode

Please do not use the Close Question function to resolve your question. Instead, add an answer and accept it.

ADD REPLY
0
Entering edit mode

Yes, I think so too. And looks like they used rankSumTestWithCorrelation from limma.

I will be following this example from Gordon Smyth. I am not sure how to know if I am right or wrong though. Maybe compare with the actual data they have shared. I'll update on how it goes.

ADD REPLY
0
Entering edit mode

I just realized this function is one-tailed, and gives left and right p-values.

I guess, I'll just use wilcox.test in a loop on the mean normalized case and control.

ADD REPLY

Login before adding your answer.

Traffic: 1780 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6