Deriving p values for gene correlations by permutation analysis

0

Entering edit mode

9.5 years ago

chris86 ▴ 400

Hi

I am calculating gene correlations for all patients gene expression data against clinical parameters. Then I want to derive p values by permutation. Is it OK to randomly permute the sample columns and then re calculate the correlation for every gene N times. Then ask how many times are correlations observed equal to or greater than a genes correlation by chance?

E.g. If my correlation is 0.3

Then the p value would be, n obs > 0.3/ total obs

So I have one distribution per gene?

Thanks,

Robert

RNA-Seq next-gen • 2.9k views

ADD COMMENT • link updated 3.1 years ago by Ram 45k • written 9.5 years ago by chris86 ▴ 400

1

Entering edit mode

It sounds about right to me. Have a look also at the boot package in R (and this FAQ http://www.ats.ucla.edu/stat/r/faq/boot.htm). Once you have the p-values, I guess you should correct them for the multiple tests you performed (i.e. one test per gene).

ADD REPLY • link updated 3.1 years ago by Ram 45k • written 9.5 years ago by dariober 15k