Converting free-form molecule names into SMILES
1
1
Entering edit mode
9.5 years ago
mrbelt ▴ 10

I have a list of of names of molecules, some of them rather free-form (e.g "Thiolated L-alanine"). I would like to convert these names into SMILES strings. Is there any tool that could assist with dealing with these ambiguous names?

cheminformatics SMILES • 6.9k views
ADD COMMENT
3
Entering edit mode
9.4 years ago
cdsouthan ★ 1.9k

The Pub Chem Identifier Exchange Service will certainly assist you for the more standardised names that map to CIDs (that you can download as SMILES)

You can also try Chemicalize.org (if the SMILES download is working)

However, there is (by definition) no solution to ambiguous and/or non-standard names (free form as you put it). You will have to eyeball - or better still, find the real person who has invented the free form terms you are having to deal with and get them to substitute clean standard ones such as IUPAC names (e.g. Thiolated L-alanine is even google -ve)

ADD COMMENT
0
Entering edit mode

Nothing much I can add to this. BTW, this is exactly the reason why we need literature to list InChI or InChIKeys for all compounds, just like PDB identifiers of official gene names for proteins and DNA snippets.

ADD REPLY

Login before adding your answer.

Traffic: 3000 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6