Hi, Biostars community.
According to VEP documentation, gnomAD genomes database could be used with --custom option. Example from VEP dosc:
./vep -i examples/homo_sapiens_GRCh38.vcf --cache \ --custom gnomad.genomes.r2.0.1.sites.GRCh38.noVEP.vcf.gz,gnomADg,vcf,exact,0,AF_AFR,AF_AMR,AF_ASJ,AF_EAS,AF_FIN,AF_NFE,AF_OTH
But there is no gnomAD genomes file for all chromosomes on ensembl's ftp source: ftp://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh38/variation_genotype/gnomad/r2.1/genomes/
Only data separated by chromosome is presented.
Can someone explain me, how to merge given .vcf.gz files correctly?
Hi....Have you found a source to download all sites for gnomaAD allele frequency data?? I am combining all chromosomes but it will take at least a week for my job to finish on our HPC thanks
Hi! Yes, AF included in data privided on gnomAD downloads page. BTW, if you interested in all chromosomes AF data, look at "All chromosomes sites VCF". https://gnomad.broadinstitute.org/downloads
Yes, but that's gnomAD 2, which is an inferior lift-over to hg38, not gnomAD 3, which is read alignment to hg38. gnomAD 3 doesn't offer a single VCF file which would be useful for VEP.
this is available since 2 years ago https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh38/variation_genotype/gnomad/r3.0
Version 3 is the genomes