Hi all,
I have generated a database of normalized counts for transcripts produced from an RNA sequencing experiment. I have an a priori interest in only 5 of them and was wondering whether it is legitimate to simply perform tests of significance for those 5 without considering the rest, and thus get away without FDR-adjusting the p-values. Is that ever acceptable?
Thanks!
Thanks for the reply! The initial goal was to take the non-hypothesis driven path but then my boss, based on some other experiments, decided to focus her research on a few genes only.
If you have enough samples, I'd suggest do correlation analysis and then create a network. Then identify which clusters contain those 5 genes and then examine those clusters. This way you can show something to your boss and backed by experimental data you can generate some hypothesis from it as well.
Also, as German suggested you don't really need to do FDR adjustment. It just takes care of avoiding false positives when doing multiple comparisons. If you take a really low pvalue threshold (not 0.05), FDR can be avoided.