I am working on human protein protein interaction I need to run SVM I need a negative data set comprising of non interacting protein pairs how to generate these. I found this database Negatome (http://mips.helmholtz-muenchen.de/proj/ppi/negatome/) but the the problem is it talking about most mammals I cant restrict my search to human. Is there any other database containing human non interacting protein pairs or is there any other method to do this. Thanks in advance.
As far as I remember, the Negatome database uses UniProt accession numbers so you could use UniProt's REST API to figure out which species each protein belongs to. If you want to combine data sets from multiple sources, I would suggest to map the different IDs to the same human reference genome annotation e.g. EnsEMBL.
Use the "Add comment" button when replying to an answer. This keeps the discussion organized and doesn't mislead users by creating answers that are not real answers to the question.