Entering edit mode
6.5 years ago
Björn
▴
110
How to create PCA plot and heatmaps using only significant genes in edgeR ?
I have 10 genes at signficant different levels between comparison groups. How can I create PCA and heatmaps with only those 10 genes.
Suggestions on R code will be appreciated !
Actually, this is not appropriate. If you are taking genes which you a priori know that are different between your samples and use those for a PCA then you will of course get a separation between your experimental groups. That's a circular analysis or "double dipping".
have a look at varistran R package, specifically at
Hey Björn, Have you not searched for how to do this? There is extensive material across the World Wide Web (and on Biostars) about how to do this. Kevin
Thanks Kevin, I used standard edgeR code for PCA but wanted to create PCA for 50 DE genes out of 2000. Use only 50 genes out of 2000 in PCA plot.
Okay. You could just filter your logCPM data to include only your genes of interest, and then regenerate the PCA bi-plot. I have also put code here: A: PCA plot from read count matrix from RNA-Seq
wouldn't it be more or at least equally interesting to see where the data is in the PCA on all data?
A good point, Carambakaracho