Hi everyone,
In statistics and in science in general, it is always harder to convincingly show lack of effect rather than significant differences. In low throughput experiments, one can always report pvalues and show that the difference is not statistically significant but the situation is more complex (at least to me) in genome-wide studies. For instance, in the case of RNA-seq expression data, even if there is no biological differences between two conditions, there will always be some genes significantly differentially expressed because thousands of genes are tested. In such a case, one can not just say "we didn't see any significant differences between condition X and Y".
How would you illustrate and discuss such a case ? Do you have any example of publications that address this issue ?
Here are some ideas :
- Discuss that there is less DEG between the conditions X and Y than between X and Z (where there is an effect that has been biologically confirmed). However I find this a bit weak.
- Discuss that there is obviously no global differences between X and Y beside some differences that might be anecdotical.
- Show MA-plot/volcano plot and let the reader decide for himself.
Best,
Carlo
A possible (but maybe also not strongly convincing) way would be to perform GO/KEGG enrichment and conclude that there are "no meaningful" differentially expressed genes.
Thank you for your input. Yes, that could be an interesting point in some cases. However sometimes the genes are deregulated based on their "genomic features" (position on chromosomes, presence of introns, nearby ncRNAs, ...) rather than their biological function and there is no functional link (GO/KEGG) between the DEG, even if there is a true effect.
Use a more stringent multiple-testing adjustment so no tests are significant after adjustment. Just kidding, don't do that. :)
Yep, I thought the same thing ^^
More seriously, let's assume that we can't change the stringency because we want to keep the same parameters across the full study (that includes more conditions than X and Y).
If there is no difference in X & Y (and if that has no bearing/influence on the conclusion(s) of the study) why not report the fact as is?
Perhaps you could compare condition X vs X (e.g., use 6 biological replicates for 3 vs 3 comparison) to demonstrate that variation plus a large number of genes invariably identifies some genes with differential expression. Or you could compare mixed (XY vs XY) or even randomly sampled data to make the same point.
If you start off with the null hypothesis "I will not detect a difference in mRNA expression between these two animals", then multiple-testing adjustments is probably the least of your worries.
replicate using orthogonal assays, like qPCR for RNA-Seq. There are much larger issues with RNA-Seq than just this.