Hello,
I have a simple RNA-seq experiment with treatment and control, each with 3 biological repeats. I run my data through edgeR and obtained differentially expressed genes (DEGs). Due to the low sample number and small effect size, there are likely more genes affected by the treatment that didn't meet the cutoff. For that reason, I want to try extracting more information from my data using a permutation test. I resampled my data by shuffling the columns and generated 1000 permuted dataframes. Next, I run each dataframe through my edgeR pipeline, which produced results such as RowName, logFC, logCPM, LR, and PValue. My qustion is, Which value (for example, logFC or PValue) do I take from each permuted dataframe to generate the distribution for each gene and calculate the p-value? Also, what is the p-value calculation?
Thank you!
Cross posted on https://stats.stackexchange.com/questions/644546/permutation-test-in-edger (with a comment of mine). In the future, please state that you are posting on multiple sites and add links so people can check whether an answer exists.
Posted to at least four forums.
For the record, I don't think that permutation is a useful approach in this context. I made a fairly detailed answer on Bioconductor: https://support.bioconductor.org/p/9157546/#9157691