I've got a list of codding genes and I would like to get the cellular location of the expressed proteins, all at once. I already looked up on uniprot but wasn't able to solve it.
I've got a list of codding genes and I would like to get the cellular location of the expressed proteins, all at once. I already looked up on uniprot but wasn't able to solve it.
You could use data from the human protein atlas or neXtProt for this.
http://compartments.jensenlab.org allows you to look up each gene individually, or download a flatfile with the location of each.
If you want to use UniProt, you can use the "Retrieve/ID mapping" service to upload your list of identifiers (or gene names): http://www.uniprot.org/uploadlists
Select the appropriate value under "From:" (e.g. HGNC, Gene names, RefSeq), and choose to map to UniProtKB.
Once you have your result, you can use the "Columns" button to customize your output, and in particular to add a column for subcellular location (and optionally remove other columns). These customized results can then be downloaded in tab-delimited format. See also http://www.uniprot.org/help/customize
Please don't hesitate to contact the UniProt helpdesk if you have additional questions.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Hi, thank you for your answer.
Can I upload my data over there or should I search gene by gene?
I am not sure about Protein Atlas. We actually downloaded some of the data they use for our own TissueAnalyzer on WIkiPathways. So you definitely download the data and use it locally and do your own queries, but they might support online queries too.
NeXtProt definitely offers a SPARQL endpoint (that was actually presented at a recent ISMB keynote) and might have other services to query it with the data you want to use. You might still prefer to download the data though.