I have a non-Caucasian population with genotype data that I want to impute to identify rare variants. I can use the Genome Aggregation Database (gnomAD) reference but the populations used to generate this panel do not include my population of interest. However, I have 800 exomes sequenced from my population of interest. From other posts, I can see that merging reference panels might be a useful alternative, but I wonder about the implications of merging gnomAD with only exome data from my population of interest, as opposed to WGS data. How would this affect rare variant detection? Would it make sense to limit my rare variant detection to exomes only?
How can I merged gnomAD with my population of interest? I'm working with tumor-only samples and I'm doing a somatic variant calling with Mutect2. As a
--germline-resource
GATK recomend used af-only-gnomad.hg38.vcf.gz. However, I want to consider the AF of my population (ABraOM). Is it possible merged this two dataset? how can I do it?Please post as a new question and then come back and delete this.