I've a list of ~50 genes and I want to know their common downstream or upstream target genes (within all known pathways). My idea was to download for every gene, the list of genes that interact with it, and intersect the results. But I don't know where to start (KEGG, REACTOME,...) I know it's possible to do it with IPA but I'm looking for a free solution.
You can accomplish this via the Pathway Commons 2 web service. The query type of interest to you is the commonstream in either upstream or downstream directions. For example, the following gives you the common downstream of EGFR and ERBB2:
You can either script this via your programming language of choice, or Paxtools library (Java) or PaxtoolsR. If you rather work with a graphical user interface, then I suggest you either use ChiBE 2 or CyPath2 -- both of them will allow you to run these queries easily.
Thank you that's great. Just a quick question about pathway commons 2. How can I specify to have the gene symbol in place of uniprot id url (for example: http://identifiers.org/uniprot/Q99075 gives me HBEGF).
Right now it is not possible to request another identifier through the web service, but the last example which produces EXTENDED_BINARY_SIF, will help you get this. In that file, in addition to the standard identifiers, you also have access to other identifiers associated with the node (like HGNC, Entrez Gene ID and etc.) It requires a little bit more parsing, but has much more information compared to simple BINARY_SIF.
KEGG and REACTOME are indeed good places to begin to identify the genes encoding proteins that reside in pathways just upstream or downstream of your set of 50 genes, with regard to information flow. The word "target" could imply transcription factor (TF) binding, or your gene encodes a TF. If this is what your genes and their analysis entails, you'd need a different approach, but I'm not certain this is your intention.
Thank you that's great. Just a quick question about pathway commons 2. How can I specify to have the gene symbol in place of uniprot id url (for example: http://identifiers.org/uniprot/Q99075 gives me HBEGF).
Thanks
Right now it is not possible to request another identifier through the web service, but the last example which produces
EXTENDED_BINARY_SIF
, will help you get this. In that file, in addition to the standard identifiers, you also have access to other identifiers associated with the node (like HGNC, Entrez Gene ID and etc.) It requires a little bit more parsing, but has much more information compared to simpleBINARY_SIF
.