Hi, guys, I have downloaded HPRD database Release 9 which have 39240 interactions and i want to delete the self-interactions and duplicated interactions programmaticaly, but i don't know how the duplicated interactions looks like in this data .
I mean self-interactions like this:FES FES, but what about duplicated interactions looks like ? Plz I appreciate any help !
@Jean-Karim Heriche Friend is HPRD data release 9 has such form of duplicated interactions? like {'A' 'B' } and {'B' 'A'}?
I don't remember and I don't have the data anymore. Anyway, my scripts were always set up to remove duplicates unless I cared about the distinction (e.g. different types of experiments). You don't say what you're trying to do but if you want a more comprehensive set of human protein interactions, I would suggest to use iRefIndex.
I have the HPRD data release9 as a textfile and I want to remove the duplicated interactions from it.
I understood that you have HPRD data and want to remove duplicates. I already answered this: just write your data processing script in such a way that if there are duplicates, it deals with them in the way you want. If you just want to know whether or not there are duplicates, just write a simple script to find out. By "what you're trying to do", I was referring to what biological question you're trying to answer and wondering whether HPRD is the best data set for this.
I want to build a network by linking the different list of genes I found based on any human data (should be human data) is iRefIndex.can do this job? is it human data?
IRefIndex is a compilation of several protein-protein interaction databases and so includes human data. Read the paper to understand how it's done. To get human data only, just filter on the taxon ID in the relevant columns. So if you need to look for interactions involving genes in your lists then you're better off using iRefIndex (or any other compilation of multiple data sources) than just a single (outdated) data source.
To access the iRefIndex data, you can also use the iRefR package for R and there's a plug-in for Cytoscape 2.8. Finally there's also a web interface at iRefWeb.