Hi All,
Who can share a perl/python script to transfer phased vcf to phased tped?
Thanks.
Update: plink
will re-order the alleles therefore 'phase' status will be broken if plink
was used in the data processing. Thanks for the explanation to it: the order in which the alleles appear in heterozygous genotype calls is usually determined by which allele is major/minor in the immediate dataset; this ordering will not vary between samples
Not answering your question but you have perhaps seen this.
This is the correct 'answer'. If you care about representing genotype phase in text, use VCF.
Yes. I think there should be some wheels outside.
Wheels? That word does not make sense in this context. Could you explain using different words, maybe?
Quote from PLINK docs:
PED files do not hold genotype, phased or not, information. Are you sure you're asking the right question?
PED files do hold genotypes, see https://www.cog-genomics.org/plink2/formats#ped
There are two prevalent PED formats - the one used/generated by plink has genotype information after the first six columns. The subset of this file with the first six columns alone is used in other tools, such as GATK's PhaseByTransmission, etc, and is the more prevalent one for clinical genetics usage. PLINK calls this format the
.fam
file.Oops! Got confused with the .fam files. Thanks for the info!