I am using an Illumina omni 2.5 genotyping array and plan on using impute2 to perform imputation.
The documentation of impute2 recommends to process whole chromosomes in chunks of ~5MB. However, I did not find any information regarding the minimal number of SNPs that should be present per chunk.
It feels "wrong" to impute tens of thousands of genotypes from the reference if there are only ~100 SNPs in the chunk that I am analyzing.
Is my gut feeling just wrong here or can you tell me any recommendations on how to deal with this?
Cheers!
Tip - there are many faster/more memory imputation algorithms than IMPUTE2 - check out beagle5 or IMPUTE5 - they will be much easier to use.
Thanks! I will do that