I want to know why the correlation of PRS obtained by plink and PRSice is low.
Firstly, I used plink --clump. The parameters are same as PRSice process.
plink --bfile HCP --clump gwas --clump-p1 0.00145 --clump-r2 0.1 --clump-kb 250
Then, I used the clumped snp (include OR and effect allele) as input to compute score:
plink --bfile HCP --score gwas_0.00145P_clumped 2 4 6
I obtained the PRS of every subjects by plink.
PRSice calculations follow the manual strictly.
Then I checked the correlation between PRS from plink and PRSice.Pearson correlation coefficient is 0.49 ,p =0.
Hi,Sam. Thank you for your reply. I want to ask another question. Before we calculate the PRS by PRSice2, we need to confirm two data set have the same variants and effect allele.
Both in base and target data, the name of variants are the same, but the effect allele are different.
For example:
In target data: 1 rs181193408 83084 A T In base data: 1 rs181193408 83084 T A
Some snps have different effect allele as above, and some have the same effect allele.
Should I change the "A T" to "T A" of these different effect allele of snps and don't change the position of allele of snp which have the same effect allele in target data?
As long as the encoding is consistent, than PRSice should be able to handle them correctly.
Why the polygenic risk scores calculated by Plink and PRSice-2 are different
Hi Sam, I have asked a PRS related question. Could you give suggestions for it? Thanks!