how to Automate the Retrieval of Variant-related Information from Pubmed, dbSNP, ClinVar, HGMD etc
3
2
Entering edit mode
6.7 years ago
gsr9999 ▴ 310

Dear Biostar leaders,

I have started to work on a new Variant Re-Classification project in my lab, and I am wondering how others implement this and it would be great if you could share/advice on best approaches being practiced. After sanger confirmation of WES variants, we classify variants based on ACMG guidelines and it is majorly carried out in 2 steps:

A. Manually search for variant specific information in Pubmed, Google search engine, variant databases(like ClinVar, dbSNP, COSMIC, LOVD, ExAc, 1000Genomes, HGMD etc.) and

B. Manually Review and Re-Classify Variants based on the newly assembled variant information.

I need to automate only the step A i.e. automate the process of collecting and integrating all latest variant associated information from different sources. Manual collection of variant information is laborious and we believe that by streamlining this step, we could improve our efficiency and it would be great to learn what the best practices are and how others implement this ?

I am wondering what the best possible approaches are like :

i) Download and localize all databases like dbSNP, ClinVar, COSMIC etc into a local server and then write python and SQL scripts to pull information specific to a mutation ? (or)

ii) Re-Use or write some web crawlers/data-mining to search new publications in Pubmed and in Google search engine for any variant/mutation specific information ? (or)

iii) Use Entrez APIs (E-utilities) to programatiaclly access online databases like ClinVar ? (or)

iv) etc ?

I am aware of some commercial companies that offer this service as a product like Mastermind(genomenon), VarSeq(goldenhelix), alamut-visual(interactive-biosoftware), etc. I guess companies like this have downloaded, integrated and curated all the data from literature and databases. These would be a good resource, but I am not sure if we could afford for a commercial software.

We are a small laboratory and I am a lone bioinformatician and trying to automate and integrate the variant associated information from dbSNP, Pubmed, ClinVar, etc. without a commercial software. I am aware that this would be a challenging project, but I am trying to achieve this in a sensible and best possible way.

Thanks, gsr

next-gen genome SNP • 2.7k views
ADD COMMENT
1
Entering edit mode
6.7 years ago

There won't be a single solution because each database is accessed differently. You have also already more or less given your own answer in your question.

I would download local copies of databases for ClinVar, dbSNP, and COSMIC and then access information from these via Python. However, version control then becomes important because these datasets are updated regularly.

HGMD requires a licence for the updated version; older versions of the database don't require a licence.

LOVD can also be downloaded, apparently on a gene-by-gene basis: http://www.lovd.nl/3.0/faq#FAQ14

For ExAc and 1000Genomes, you can also download these in VCF and then look up variant frequencies: http://www.internationalgenome.org/faq/how-can-i-get-allele-frequency-my-variant/

It's a lot of setup work but it would work when done. You could have the skeleton of the entire script finished in under 1 week.

Kevin

ADD COMMENT
1
Entering edit mode

Thanks for ur ideas Kevin

ADD REPLY
0
Entering edit mode
5.7 years ago
ND Woods • 0

There is a basic edition of Mastermind that is free and when you sign up you get a free 14 day trial of the pro version. We have 15x more variants than HGMD. You can even do a few searches with anonymous search which we added in the past 6 months or so. Just in case you want to check it out before having to fill out the form and actually sign up. I'll attach a link below, hope this helps, good luck!

https://mastermind.genomenon.com/

Also, we have variant interpretation cards on Amazon you might like. There's also a PDF version, I will attach both.

Amazon: https://www.amazon.com/dp/B07N45VK8W PDF Link: https://www.genomenon.com/vi-cards-printable/

ADD COMMENT
0
Entering edit mode
ADD COMMENT

Login before adding your answer.

Traffic: 1184 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6