Plink Logistic regression controlling for pop. structure in tri-hybrid admixed population
1
0
Entering edit mode
6.9 years ago
Guilherme ▴ 40

Hello,

I have done an analysis in STRUCTURE to get individual ancestry estimation in my population sample. My population is a tri-hybrid population with ancestry of European, African and Amerindian. My question is:

Since It's a tri-hybrid admixed population, do I have to include in the logistic regression model the three columns (EUR, AFR, AMR), as covariates in order to control for possible confounding effects from population stratification?

I'm asking this because I have seen people using only two (for example, EUR and AMR, or AFR and AMR)...

Thanks,

plink Logistic regression admixed • 2.3k views
ADD COMMENT
1
Entering edit mode

Are you interested to do single-marker analyses (GWAS)? Feed data to gemma, it's going to create kinship matrix. Proceed with kinship matrix, your phenotype, covariates. Kinship takes care of population stratification. Gemma is good for admixed population.

ADD REPLY
2
Entering edit mode
6.9 years ago

Ciao Guilherme,

I am not so sure about the choice of STRUCTURE. It's reliability has been refuted: The computer program STRUCTURE does not reliably identify the main genetic clusters within species: simulations and implications for human population structure.

If your population has mixed ethnicity, then population structure, i.e., ethnicity, will most likely be a confounder for which you should adjust in your logistic regression model. You can visually check the extent of the confounding (and statistically check via various metrics) through PCA: Produce PCA bi-plot for 1000 Genomes Phase III in VCF format (old)

The standard way to adjust for population structure is to include the sample loadings from PC1 and PC2 as covariates (or more PCs, if you feel necessary). PCA can easily be performed within PLINK or using other programs, such as EIGENSOFT or GCTA (the actual PCA implementation in PLINK is the same as GCTA).

There is a previous Biostars thread relating to a similar topic: GWAS: when is it appropriate to add covariates?

Abraços,

Kevin

ADD COMMENT
1
Entering edit mode

Thank you so much for the enlightenment Kevin! I'm going to take a look at PCA.

ADD REPLY
1
Entering edit mode

De nada cara

ADD REPLY

Login before adding your answer.

Traffic: 2016 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6