Retrieving SMILES using CAS or SAMPLE ID or SAMPLE_NAME
1
0
Entering edit mode
7.3 years ago

Hi,

I am trying to generate SMILES for a set of molecules. As shown below, I have the sample _ID, name, Pubchem_SID (which looks funny) and CAS. I tried using Pubchempy to extract SMILES using Pubchem_SID but it failed. Given the fields below, I'd appreciate some direction on how I can generate SMILES for these molecules. Thanks!

**SAMPLE_ID**
NCGC00013369-03
NCGC00013556-01
NCGC00016522-05
NCGC00016544-01

**SAMPLE_NAME**
Pamoic acid
3,5,3'-Triiodothyronine
Bisacodyl
Dehydroepiandrosterone acetate

**PUBCHEM_SID**
144206996
144203580
144204023
144204036

**CAS**
130-85-8
6893-02-03
603-50-9
853-23-6
cheminformatics PubChem SMILES • 5.0k views
ADD COMMENT
2
Entering edit mode
7.3 years ago
prasundutta87 ▴ 670

Many ways to achieve what you want-

1) Pubchem_SID is Pubchem Substance ID..so if you go to the Pubchem homepage, select 'Substance' instead of 'Compound' in the Search bar and search for, for example, '144206996', you will get Pamoic acid. You can download the 2D SDF and then convert it to SMILES for any of your downstream analysis using OpenBabel or any format converter that you possess..

Furthermore, if you want to download all the SIDs at once, you can write a small perl script based on this example command:

wget -O 144206996.sdf https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/sid/144206996/record/SDF

You can write a loop to change the SIDs and all your SDFs will be downloaded at once that you can convert into SMILES...

2) Go to https://pubchem.ncbi.nlm.nih.gov/pc_fetch/pc_fetch.cgi, choose 'Pubchem Substance' as the database, give a list of SIDs in 'Enter IDs' or upload the list, select SMILES in 'choose a format', and click 'Download'..you will get what you want..

3) If you want to continue using Pubchempy, check this out: https://pubchempy.readthedocs.io/en/latest/guide/substance.html#retrieving-substances

ADD COMMENT
0
Entering edit mode

Thank you so so much! I think I'd go with the Pubchempy

ADD REPLY

Login before adding your answer.

Traffic: 2767 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6