Question

IMPUTE2: how to dynamically obtain chunks for each chromosome

0

Entering edit mode

4.4 years ago

m98 ▴ 420

I would like to run IMPUTE2 on some pre-phased data (made using SHAPEIT). The command I would like to use is like the example one given by IMPUTE2:

./impute2 \
-known_haps_g gwas_data_chr10_phased.haps \
-h pilot1.jun2010.b36.CEU.chr10.snpfilt.haps \
-l pilot1.jun2010.b36.CEU.chr10.snpfilt.legend \
-m genetic_map_chr10_combined_b36.txt \
-int 20000000 25000000 \
-Ne 15000 \
-buffer 250 \
-o gwas_data_chr10_imputed.20-25Mb.gen

I know it is best to split imputation in chromosome chunks (chunks of 5Mb here and set with the option -int). My question is how, given the file below which shows the chromosome and its size, is there a way I can dynamically obtain these chunks so I can write batch jobs?

1   249250621
2   243199373
3   198022430
4   191154276
5   180915260
6   171115067
etc

Many thanks.

IMPUTE2 chunks • 1.5k views

ADD COMMENT • link updated 4.4 years ago by jmukisa90 ▴ 30 • written 4.4 years ago by m98 ▴ 420

score 0 · Answer 1 · 2020-07-16

0

Entering edit mode

4.4 years ago

jmukisa90 ▴ 30

Hi M93, It seems you have a similar problem like the one I had here:ERROR: You must specify a valid interval for imputation using the -int argument, -use_prephased_g: command not found, in IMPUTE2. The solution provided by Kevin will be helpful.

ADD COMMENT • link 4.4 years ago by jmukisa90 ▴ 30