Determining the subjects to use after obtaining a PC plot of population substructure
0
0
Entering edit mode
7.4 years ago
Sheila ▴ 460

I am doing QC for a GWAS analysis. I used pc-AIR and pc relate (two Bioconductor Packages) to determine the relatedness and population substructure of my given dataset. I compared it to 1000 genomes data and have a plot comparing the first two PCs in my PCA analysis. In general, what is the best practice for excluding subjects from a study after visually scrutinizing the PC plot. Is there a specific method (ie R package) to use that's considered best practice? or do I arbitrarily decide that base on the graph I want to include a certain set of subjects?

Thanks for your thoughts, in advance.

population stratification best practices GWAS • 1.5k views
ADD COMMENT
0
Entering edit mode

Here you will find a very detailed answer. https://stats.stackexchange.com/questions/8777/in-genome-wide-association-studies-what-are-principal-components Also, I suggest you give a look to GENABEL manual (http://www.genabel.org/sites/default/files/pdfs/GenABEL-tutorial.pdf). In paragraph 5.3 they describe the method used for outlier detection.

ADD REPLY

Login before adding your answer.

Traffic: 2737 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6