input for gatk VariantAnnotator --dbsnp
1
0
Entering edit mode
7 weeks ago
QX ▴ 60

Hi all, I am run:

gatk VariantAnnotator \
       -R reference.fasta \
       -I input.bam \
       -V input.vcf \
       -o output.vcf \
       --dbsnp dbsnp.vcf

I would like to ask where can I find the dbsnp.vcf files. is that the 1000GENOMES-phase_3.vcf.gz? or the vcf from https://ftp.ncbi.nih.gov/snp/organisms/human_9606/VCF/

VariantAnnotator gatk • 435 views
ADD COMMENT
1
Entering edit mode
7 weeks ago
Shred ★ 1.5k

Literally the first result of a Google search

https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0/

ADD COMMENT
0
Entering edit mode

your link does not even give the description and how them were generated; how can you trust?

ADD REPLY
1
Entering edit mode

That link comes from GATK help article here: https://gatk.broadinstitute.org/hc/en-us/articles/360035890811-Resource-bundle

It is the resource bundle provided by Broad Institute.

ADD REPLY
0
Entering edit mode

thank you!

ADD REPLY
0
Entering edit mode

but do you know what is the 1000GENOMES-phase_3.vcf.gz and can it also be used?

ADD REPLY
0
Entering edit mode

1000 Genomes data is population level survey, you can read about it here: https://www.internationalgenome.org/category/phase-3/

The goal of the 1000 Genomes Project was to find common genetic variants with frequencies of at least 1% in the populations studied.

ADD REPLY

Login before adding your answer.

Traffic: 907 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6