GWAS on traits non-normally distributed
1
0
Entering edit mode
3 months ago
AndrMod • 0

I'm about to run several GWAS on a set of traits measured on ~160 individuals. I want to use FarmCPU which, as I understand, implements a MLM with pseudo-QTN as coviariates (fixed model); hence it assumes normality of the trait. While analysing the traits I noticed that none of them has a normal distribution. The prevalent shapes are these: enter image description here

enter image description here

enter image description here

enter image description here

I also assessed the non-normality with a Shapiro-WIlk test. Some of these traits are residuals of a lm(trait ~ covariate), but not further quantile-normalised. Other are parameter of a model predicting the trait values. I'm not very convinced I should transform them more.

I'm aware that most of the trait don't follow a normal distribution; nonetheless, this is concerningme a lot, as -without a reference distribution- I also can't pick an approach to detect phenotypic outliers.

The first histogram can roughly look like a Poisson but "roughly doesn't make it a Poisson proper.

Someone can shed a light on how to deal with them?

normality GWAS phenotypes • 380 views
ADD COMMENT
0
Entering edit mode
12 weeks ago

Hi,

Please be aware that, for a statistical model, it is not the actual values that need to follow a normal distribution; rather, it is the residuals after you have fit the model.

Log-transformation, scaling between certain values (e.g. 0-1), encoding as tertiles or quartiles, et cetera - these are all valid approaches.

Kevin.

ADD COMMENT

Login before adding your answer.

Traffic: 1659 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6