Hello,
I have a plink 2.0 file ("Ref" is always the major allele) and want to reorder its ref and alt alleles by a fasta reference. For this I use the --ref-from-fa command. However, the following error occurs:
Error: --ref-from-fa wants to change reference allele assignment at X:2700157,but it's marked `as 'known'. Add the 'force' modifier to force this change`
What does marked as "known" mean? How should I go on?
Best,
Andreas
Hmmm. My data does not come from a "regular vcf" file, but from a standard GWAS-microarray workflow with PLINK. How would you proceed? --> "Force" it?
Yes, if you know the supposed REF alleles are really just major alleles, that's what "force" is for.
If I do so, I get the following report:
In total I only have 4686 variants...
The fasta reference which I use is from here: ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/human_g1k_v37.fasta.gz
And here is my command:
What could be wrong here?
Thanks for your help!
Perhaps your original dataset had "backwards" alleles (with REF usually A1 instead of A2) for some reason, while still being correctly encoded. You can sanity-check this by running --freq on your updated dataset: if the alternate allele frequencies are usually low, you're probably fine.