Hi I am working on PharmGKB data. Which is on dbSNP version 147 while my vcfs are on dbSNP 150 version.
I am searching PharmGKB rsids in my file. But I think due to different version of dbSNP some rsids are not found.
I want to convert PharmGKB rsids from 147 to 150. But their are no coordinates of genome available in PharmGKB. If I searched coordinates and then use these coordinates to update rsids then this will be long process. I searched and found dbSNP batch query on internet. But on its site it is announced that it will be retired in June 2018.
Can any one please tell me how should I update rsids without finding its coordinates?
Please give an example of the data you have. If you have a VCF file, you could simply download the latest VCF from dbSNP and use Tabix or the Ensemble Variant Effect Predictor to directly annotate each variant with the current dbSNP (151). I would strongly recommend to use dbSNP151 instead of 150 because there was a major update recently, including lots of variants from GRCh37. In my recent case, 151 flagged a lot of variants as common (based on 1000Genomes and TOPMED), which I previously considered rare, simply because they were not included in dbSNP150 with any AF from 1KG or TOPMED.
I do not have vcf. I have data in this format.