Entering edit mode
6.9 years ago
mtscales
▴
20
Heya,
In the SIFTS flat file I downloaded (pdb_chain_uniprot.tsv.gz from https://www.ebi.ac.uk/pdbe/docs/sifts/quick.html) I am confused because for 153 of the (uniprot, pdb_chain) mappings SP_BEG > SP_END.
e.g.
PDB 1bxh
CHAIN A
SP_PRIMARY P02866
RES_BEG 1
RES_END 237
PDB_BEG 1
PDB_END 237
SP_BEG 164
SP_END 148
Is this an error or is it simply that the UniProt sequence and PDB sequence have the same sequence of 27 side chains but running in opposite directions w.r.t. the backbone.
Also, it seems a bit odd to me that the PDB seems to be for a protein with 237 AAs, but it only 'covers' 27 AAs on the mapped uniprot sequence.
Any ideas? I am quite confused.
Thank you!
Hi did you check the .pdb file of your PDB id? DBREF section of PDB file will tell you to extract mapping of your PDB chain with corresponding Uniport sequence.
Thanks Pallab, I had not looked in the .pdb file!
From .pdb file:
From my SIFTS mapping file:
So it would seem that the SIFTS file is _wrong_, and that the mapping should be like:
Would you say it is safer to trust the .pdb file than the SIFTS flat file? It is understandable that SIFTS made a mistake in this situation as it looks like an edge case, where [x][y] in a PDB structure maps to a uniprot that looks like ---[y]--[x]---
Thank you very much for your reply Pallab. I am a bit busy hence I am only just revisiting this. But thank you!