Plink error: Variant names are limited to 16000 characters.
0
1
Entering edit mode
3.5 years ago
biyao.wang1 ▴ 10

Hi plink experts,

I'm trying to convert *.dose.vcf(.gz) files (product of the Michigan Imputation Server) to plink binary ones and got error since some snps has extremely long IDs. Does anyone know how to tackle this?

Commands I've tried in plink1.9 and 2:

--set-missing-var-ids @:#[b37] 
--set-all-var-ids  @:#[b37] 
--new-id-max-allele-len 
--vcf *.dose.vcf dosage=HDS 
--snps-only 
--biallelic-only 

I also tried DosageConvertor which has a function --trimNames but it resulted in dosage.gz, map, fam format files, which I don't know how to use...

I found very limited info online about this (see https://groups.google.com/g/plink2-users/c/u6nvaI1Olyg), it suggested to trim ID before converting in plink but did not exactly say how to do it.

Any suggestion will be appreciated! Many thanks in advance!

plink imputation • 1.3k views
ADD COMMENT
0
Entering edit mode

As it is a VCF file, maybe update the variant names before using plink, see below post:

ADD REPLY

Login before adding your answer.

Traffic: 2911 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6