Is There A Toolkit Which Provides Access To Pubmed, Pubchem, Chemspider, Kegg, And So On...?
4
0
Entering edit mode
13.2 years ago
Lo ▴ 10

I am on the lookout for a toolkit which provides convenient access to all those platforms? (...and is not Taverna - which is a great project).

data • 3.1k views
ADD COMMENT
2
Entering edit mode

"And so on"? You mean, every database? Please clarify.

ADD REPLY
1
Entering edit mode

related: "How can I unify genome data from different sources?" http://biostar.stackexchange.com/questions/3566/how-can-i-unify-genome-data-from-different-sources

ADD REPLY
1
Entering edit mode

@Pierre: No its not related. The questions comes with a chemical-premise, not a genomic one.

ADD REPLY
0
Entering edit mode

related: "Genomic data integration - where to find good information?" http://biostar.stackexchange.com/questions/9000/genomic-data-integration-where-to-find-good-information

ADD REPLY
0
Entering edit mode

'and so on': as much as possible, the best connection-toolkits out there

ADD REPLY
3
Entering edit mode
13.2 years ago

Well, we try to... Check [?]openphacts.org[?], but that is a long way to go. Although, yes, even in the prototypes we do have KEGG (which might have to go because of copyright), ChemSpider (they are a partner) and some others.

The whole idea is to combine this kind of data in large datastores that know about things that identify the same object/concept and define fixed relationships (predicates) to other objects using semantic web approaches which can be queried using RDF. All that is part of what is usually called the semantic we, ad pharma tries to use it for relationships that involve chemicals and this potential drugs.

BTW the Manchester (Taverna) group is involved in that as well.

ADD COMMENT
2
Entering edit mode
13.2 years ago
Neilfws 49k

You mention ChemSpider in your question. ChemSpider is itself derived from many sources, including the other 3 that you mention (Pubmed, Pubchem and KEGG). And it comes with a set of web services. So I suspect that ChemSpider itself is one of the best toolkits there is.

It's unusual to find ready-made "connection toolkits" for multiple databases of any kind. In general, you write them yourself around the available APIs.

ADD COMMENT
0
Entering edit mode

I am in the process of doing that. But I don't like to reinvent the wheel. Additionally,I would like to connect with people who are also working on cheminformatics connection toolkits.

ADD REPLY
1
Entering edit mode
13.2 years ago

The Cactvs toolkit www.xemistry.com/academic for a free academic/educational version) supports most of these.

  • Advanced access to PubChem (both structures and assays, with full support for handling the native ASN.1 data, with lookup of CIDs, SIDs, names, etc, plus access of the PubChem structure set as a virtual multi-record file with query operations automatically forwarded to Entrez/PubChem Structure search)

  • Structure resolution from many identifiers via NCI Lookup service, KEGG or PubChem

  • Determination of structure IDs and display URLs for PubChem, KEGG, ChemSpider, and about 2 dozen other Internet structure sources

  • Some support for preconfigured Entrez/PubMed queries (names, MESH terms, etc.). more can easily be scripted.

Please contact me directly if you need help to get started.

ADD COMMENT
0
Entering edit mode
13.1 years ago
Anon ▴ 10

Use Entrez eUtils... and PUG/SOAP...

ADD COMMENT

Login before adding your answer.

Traffic: 1972 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6