Hi all,
I was using annovar to remove common variants from my list of potentially interesting variatns using dbsnp132. However, I was finding that 99% of my variants were being found in dbsnp132, which is too many to remove from the analysis.
After reading some comments about dbsnp129 being more widely-used I switched to annotating using this database. I downloaded the hg19 version of dbsnp129 from the annovar website. However, according to annovar none of my variants are found in dbsnp129. Can this really be true?
Here is the annovar run with dbsnp132
[dunnin01@uk-cri-lcst01 annovar]$ ./annotate_variation.pl --buildver hg19 -filter -dbtype snp132 myvars.txt humandb/
NOTICE: Variants matching filtering criteria are written to myvars.txt.hg19_snp132_dropped, other variants are written to myvars.txt.hg19_snp132_filtered
NOTICE: Processing next batch with 36064 unique variants in 36079 input lines
NOTICE: Scanning filter database humandb/hg19_snp132.txt...Done
NOTICE: Variants with invalid input format were written to myvars.txt.invalid_input
[dunnin01@uk-cri-lcst01 annovar]$ wc -l myvars.txt.hg19_snp132_dropped
35445 myvars.txt.hg19_snp132_dropped
And with dbsnp129
[dunnin01@uk-cri-lcst01 annovar]$ ./annotate_variation.pl --buildver hg19 -filter -dbtype snp129 myvars.txt humandb/
NOTICE: Variants matching filtering criteria are written to myvars.txt.hg19_snp129_dropped, other variants are written to myvars.txt.hg19_snp129_filtered
NOTICE: Processing next batch with 36064 unique variants in 36079 input lines
NOTICE: Database index loaded. Total number of bins is 2699057 and number of bins to be scanned is 0
NOTICE: Scanning filter database humandb/hg19_snp129.txt...Done
NOTICE: Variants with invalid input format were written to myvars.txt.invalid_input
dunnin01@uk-cri-lcst01 annovar]$ wc -l myvars.txt.hg19_snp129_dropped
0 myvars.txt.hg19_snp129_dropped