How to perform gene set enrichment analysis on genes that contribute to a specific PC?
0
0
Entering edit mode
3.0 years ago
Aspire ▴ 370

In data I have, the way the samples clustered according to a specific PC is interesting biologically. Samples to which high amounts of a compound were added are on one side of the PC, those with low amounts are on the other side, and those with a moderate amount are located in the middle.

I am interested in seeing what are the genes that are responsible for this, and more specifically - what biological functions are enriched in those genes.

So, I thought of running gene set enrichment analysis on the loadings of the genes in that specific PC. Genes that contribute strongly to the PC will have a large positive/negative loading.

My question is whether the genes must be standardized prior to the PCA? Usually, prior to PCA I use DESeq2's rlog function but do not standardize (convert the genes to Z-scores). The effect is more pronounced when not standardizing the data. ( prcomp(scale.=F) )

pca • 516 views
ADD COMMENT

Login before adding your answer.

Traffic: 1631 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6