Fastphase To Plink Format
2
1
Entering edit mode
11.7 years ago
kumar.vinod81 ▴ 340

I've imputed my plink.ped file using fastPHASE. It was imputed successfully. But know I want to convert this fastPHASE file back into PLINK format. Any idea...

fastPHASE format (2 individuals and 11 SNPs, row wise mutation)

ID 5906291001R01C01 1 1 1 0 1 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 0 1 
ID 5906291001R01C02 1 1 1 0 1 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1
plink format • 6.1k views
ADD COMMENT
1
Entering edit mode

Can you give us sample output?

ADD REPLY
0
Entering edit mode

fastPHASE format (2 individuals and 11 SNPs, row wise mutation) ID 5906291001R01C01 1 1 1 0 1 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 0 1 ID 5906291001R01C02 1 1 1 0 1 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1

ADD REPLY
0
Entering edit mode

It not look like I want to paste here. SNP sites are row wise..

ADD REPLY
2
Entering edit mode
11.7 years ago
zx8754 12k

The output looks like phased data, not imputed data. I suggest to use IMPUTE2

  1. Convert plink files to gen format - use GTOOL

  2. use IMPUTE2 for imputation

  3. then use PLINK to work with dosage output

ADD COMMENT
0
Entering edit mode

All the site are imputed but I am not able to convert them into PLINK format...I am completely failed.

ADD REPLY
0
Entering edit mode

I've not used impute2. Is it easy to use?

ADD REPLY
2
Entering edit mode
11.7 years ago
Wen.Huang ★ 1.2k
  1. convert this format to plink's tped format, which is very similar to fastPHASE outputs, just a few extra columns
  2. make a tfam file with the individual ID, etc.
  3. with tped and tfam on hands, you can use plink to make various conversion if you need
ADD COMMENT
0
Entering edit mode

Thanks Huang, but the SNP sites in fastPHASE format are in rows, not in subsequent columns. While in tped format, the SNP sites are in same row and different column. What to do? I am totally puzzled...

ADD REPLY
0
Entering edit mode

Hi Kumar, did you ever find a solution to this? I am encountering the same problem....

ADD REPLY
0
Entering edit mode

If you transpose your file, then it is similar to this question: Convert genotype matrix into plink format

ADD REPLY
0
Entering edit mode

I'm afraid the conversion is not that simple. The genotype for each individual is in rows. To use the example above:

The output of fastPHASE (2 individuals, 11 SNP loci)

ID 5906291001R01C01 1 1 1 0 1 1 1 1 1 0 1 1 1 1 0 1 1 1 1 1 0 1 
ID 5906291001R01C02 1 1 1 0 1 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1

In short the format goes something like this for 3 SNP loci per individual:

SNP1_Allele_1 SNP2_Allele_1 SNP3_Allele_1 SNP1_Allele_2 SNP2_Allele_2 SNP3_Allele_2
ADD REPLY
0
Entering edit mode

See gtool -G option for GEN to PED Conversion mode.

ADD REPLY
0
Entering edit mode

I didnt get any reply from Biostar but one of my bioinformatician friend written a perl script, and now I am always to send data to him and he send me after its formatting.... But its really not simple with online available tools. Try to contact ur BI team.... Thanks

ADD REPLY

Login before adding your answer.

Traffic: 1996 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6