Hi everyone, I'm concern about the problem of predicting binary protein-protein interaction. Two proteins are defined to interact if they form a complex executing a function transiently and stably, e.g. a protein-ligand binds to a protein-receptor, homo-oligomers protein. But I notice that given two imaginary protein sequences folding in the imaginary space will result in two imaginary protein structures. I use the word 'imaginary' here for my thought experiment. Two structures would fluctuate dynamically and atomically, some atoms would attract others, some would repel. And expand to amino acid molecules in protein structures, some amino acids would attract other amino acids, some would repel. These two structures will have some complementary domains, some attracted domains, some repel domains. So can we from residue-residue contact (in this case is the distances in space between amino acids - contact map) to say this protein interacts with this protein or not (from my first definition). Thank you for any of your contributions and arguments.
I want to make sure that I got the idea right, sorry for my eager curiosity.
I think the residue-residue contact/distance profile in the interface of any two protein structure pair placed in the same place, that profile will reflect that whether two proteins interact or not (in this case interact here refer to the meaningful interaction that will lead to biological function). If two proteins interact, I think there is a subset of residues of this protein will be close enough to a subset of residues of that protein. In contrast, if two proteins do not interact, there is not any subset of residues of this protein will be close to subset of the other. In some cases, whether exist two proteins that do form any meaningful interact but their residues stay close together in long enough time? I suppose that there is not, they can not get in close long enough if they do not form any meaningful interaction.
I hope that I got a right understanding, if it is not, please let me know. Thank you!
Your understanding is correct. Although proteins bump into each other all the time, which means that subsets of their residues get in close contact with each other, those are not considered true interactions because they do not persist over a period of time.
As I said earlier, distance predictions (and thus contact-map predictions) are usually done intra-protein, meaning residues that are part of the same protein. This is based on co-evolution of interacting residues, where pairs of residues that that are folded in such a way to interact tend to mutate in correlated fashion. It turns out that residues that are part of different proteins that interact also co-evolve (mutate in correlated fashion), which is what they are exploiting in a paper you referenced.
Contact maps derived from intra-residue distances are used as distance restraints for protein folding (and other applications), which is a different application than inter-residue distances in protein interactions, but it is based on the same general principle. Inter-residue distances can be used to restrain the search space for docking protein partners, which not only speeds up the process but also makes it more accurate.