First, why do this? The variant level QC practice of discarding markers that are far out of HWE is done in order to enrich for and eliminate markers have some kind of quality control issue.
Why "far" out of HWE? Well, certain genomic variants, such as inversion variants, are known to distort normal expectations for HWE (Sturtevant 1917). Removing an imputed variant that is slightly out of HWE could amount to discarding true positive data, for instance, if such a variant were involved.
The bottom line is that hard thresholding can discard useful data, but also can make your life easier if the vast majority of the thresholded variants are low quality. For that reason, what you are asking won't have a single answer that is correct in every case. Instead, you have to choose a level that makes sense to you depending upon your understanding of the practice of HWE thresholding and your data.
other approaches:
Many researchers take an approach that marks these variants, but stops short of removing them altogether. For example, you could leave them in the analysis but flag them, so if any of them turn out to be interesting, you have a reminder to go back and do exhaustive QC before publishing on them.
finally, there are many many GWAS QC papers published to date, see for instance turner et al or reed et al.
a final comment is that it is much more damaging to the study to have non-uniform practice BEFORE imputation than after. For instance, suppose you have more than one DNA microarray in your study. Prior to imputation one should make sure a set of variants common to both chips is selected as the imputation set. but since the question isnt about this ill stop here.
Hi,
I am in a similar situation. How did you solve that?
Thanks