Query UniProt for deleterious and neutral variations
1
0
Entering edit mode
9.3 years ago
Eli Korvigo ▴ 230

Hello everyone. In a recent article published by Dong et al. in Human Molecular Genetics ("Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies") the authors have managed to collect a big dataset of deleterious and neutral SNPs. The paper says "Training dataset is composed of 14191 deleterious mutations as true positive (TP) observations and 22001 neutral mutations as true negative (TN) observations, all based on the Uniprot database". Well, basically that's all they say about how the dataset has been collected. I would highly appreciate any recommendations on how one can query such information on UniProt, as I've found no direct ways of doing it. Thanks in advance.

SNP database • 1.8k views
ADD COMMENT
0
Entering edit mode
9.3 years ago

Uniprot is full of variants. You 'just' have to map the variation back to a reference genome.

Edit: and you can map the intersection of the known SNP with an uniprot track.

$ xmllint --xpath "//*[local-name()='feature' and @type='sequence variant']" "http://www.uniprot.org/uniprot/Q04721.xml"

<feature type="sequence variant" description="In ALGS2." id="VAR_029361" evidence="11">
<original>C</original>
<variation>Y</variation>
<location>
<position position="444"/>
</location>
</feature><feature type="sequence variant" description="In dbSNP:rs17024517." id="VAR_031463">
<original>V</original>
<variation>F</variation>
<location>
<position position="1667"/>
</location>
</feature>
ADD COMMENT
0
Entering edit mode

Thank you for your answer. The main point of the question is about sorting pathogenic variants, e.g. using the information from the "'Pathology and Biotech" field.

ADD REPLY

Login before adding your answer.

Traffic: 1611 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6