After getting help from EBI support I got a db of ligand binding sites from PDBeMotif database working, quite up-to-date and local.
I am using it for some time already, and it seems to be cool. It basically lists all bonds to hetatms from PDB, together with distance info, and with annotations on interaction type.
I wrote some quick-and-dirty Python code to download and use this db.
Installation
pip install http://modorama.biocomputing.it/beta_static/src/pdbemotif-0.1.0.tar.gz
It should install pdbemotif
Python module, and two scripts should go to your python bin:
download_selected_lig_sites.py
download_all_lig_sites.py
They download data on bonds with protein and hetatms. You can use like this:
python -u path_to_bin/download_all_lig_sites.py /data1/outdir/
python -u path_to_bin/download_selected_lig_sites.py some_pdbcodes.txt /data1/tmp/outdir/
where some some_pdbcodes.txt
file lists pdbcodes, one per each line.
Downloading all "sites" takes 3 days. You will get a list of xml files, one per each pdbcode.
Then you can do:
from pdbemotif import BoundMoleculeSite
site_file = open('/data/db/PDBeMotif/3mht.txt')
mol_site = BoundMoleculeSite(site_file.read())
site_file.close()
print mol_site.get_all_binding_resi_for_chain('A')
Can't you perform an 'advanced' query and batch download the results?
Is this not in PDBs FTP site? http://www.pdb.org/pdb/static.do?p=download/ftp/index.html
I think ligand binding site information is not there. I think the binding site data from PDBe is in the PDBe Motif, but I am not sure yet how to download and handle that.
Nir, I think I should be able to that using the PDBeMotif web interface, but I simply have no idea how. I queried EBI HELP for that, but I am still waiting for a response.
Got the response from EBI HELP. Local installation of PDBeMotif is too difficult and too hardware demanding. And the newest software won't be available. I got some hints about XML queries of PDBeMotif online, but I am not sure yet if it is possible to get all data I need.