Principle Component Analysis (PCA) for GWAS (SNP genotype data)
2
0
Entering edit mode
10.0 years ago
samorjoy ▴ 10

I'm doing a Genome Wide Association Study (GWAS) in R. I have SNP genotype data for 300 individuals. I have a total of 177,000 SNPs.

Before diving into the GWAS, I want to adjust for population stratification by doing a PCA analysis. I need to do a PCA that identifies the say top 10 principal components (PCs) and use them as covariates in the association analysis. Do you know of any R packages or software that will enable me to do this seamlessly?

I know about EIGENSTRAT that is implemented as part of the EIGENSOFT software. Has anyone used it before? I downloaded the software last night. Is there a tutorial on how to load/import the 177,000 SNPs into the software and do the PCA analysis. Do the SNPs need to be in a special format? Once I have the data, is it possible to import it into R? Do I need to write a wrapper to call the software? Any tips or information will be helpful. Thanks

SNP GWAS PCA R • 8.0k views
ADD COMMENT
1
Entering edit mode
10.0 years ago
always_learning ★ 1.1k
I have used SNPRELATE for one of such analysis few months back. You can try that once.
ADD COMMENT
0
Entering edit mode

good suggestion. thanks, I'll take a look at the manual for it and get back to you if I get stuck

ADD REPLY
0
Entering edit mode
4.2 years ago

Hi,

Did you find the answer? I think it's as simple as providing the n.PC = 10 parameter in your sommer::GWAS function. It does everything automatically.

Cheers, Ricardo

ADD COMMENT

Login before adding your answer.

Traffic: 1700 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6