Legend and hap files for imputation with 38 build
0
0
Entering edit mode
3.0 years ago
anna ▴ 20

Hi, I want to impute my data genotyped with 38 build. Where can I find legend and hap files for 38 build?

I am trying to use vcf2impute_legend_haps script provided by IMPUTE2, but it does not give me any results with vcf files downloaded from here http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/1000G_2504_high_coverage/working/20201028_3202_phased/

Thank you!

reference 38build impute imputation • 2.4k views
ADD COMMENT
1
Entering edit mode

Really you should be using the most recent version, impute5, as imputed2 is very old now. If you use impute5, it will be much faster and you can directly use vcf files rather than having to convert them into some exotic format.

ADD REPLY
0
Entering edit mode

I checked IMPUTE5, but not sure that there is an option for that..

ADD REPLY
1
Entering edit mode

There is an option to use a reference panel? Why do you think there isn't?

ADD REPLY
0
Entering edit mode

I could not find -merge_ref_panels flag as in IMPUTE2 manual here https://mathgen.stats.ox.ac.uk/impute/impute_v2.html#-merge_ref_panels

ADD REPLY
1
Entering edit mode

OK - you could merge them together using bcftools --merge if they are in vcf format and then use the output in IMPUTE5 or another more recent software.

ADD REPLY
0
Entering edit mode

Thank you very much for your help! Would you recommend phasing with SHAPEIT2? As I understand, there is a possibility to phase with and without using the reference panel, though with the last option I need again legend/hap formats.

ADD REPLY
1
Entering edit mode

If you want to pre-phase then use shapeit4 as it is the most recent version and will save you a lot of time since i) you can use vcf/bcf and directly and then use them with IMPUTE5 and ii) it's a lot faster. What size is the dataset you are phasing? If it's less than say, 2000 individuals then I would probably recommend using a reference panel, definitely if it's less than 100 samples. both shapeit4 and impute5 allow you to use reference panels.

ADD REPLY
0
Entering edit mode

My samples size is 35 samples. Thank you very much!

ADD REPLY
0
Entering edit mode

Hi, Sorry do you know any method (software, script), which I can use to compare the imputed data in vcf format? I would like to get Rsqv for position? Thank you very much!

ADD REPLY

Login before adding your answer.

Traffic: 1487 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6