download a set of proteome from uniprot
1
0
Entering edit mode
21 months ago
Juke34 8.9k

I have a bench of Uniprot proteome ID I would like to download in an automated way. e.g.

UP000005640
UP000001073
UP000001519

It sounds an easy task but after many tries I'm still unsuccessful, anyone has trick?

  • I tried using a for loop around wget command but I need a wild card or regex but I don't succeed to make it work: wget --regex-type .fasta.gz https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/reference_proteomes/Eukaryota/UP000000226/UP000000226_*.fasta.gz

  • I tried via the website but I do not get the fasta sequences, only some general description

uniprot • 1.6k views
ADD COMMENT
1
Entering edit mode
21 months ago
GenoMax 147k

https://www.uniprot.org/uniprotkb?query=proteome:UP000001519 click on download (make sure you choose all and FASTA).

Replace proteome accession as needed.

You can also get the API URL from this page.

URL

ADD COMMENT
0
Entering edit mode

Great, so my problem was to use https://www.uniprot.org/proteomes instead of https://www.uniprot.org/uniprotkb that do not deliver the same data. Sounds weird to me to not be able to download fasta proteome from the proteomes side of uniprot!

I have finally used API URL with the snippet they provide: https://www.uniprot.org/help/api_queries (2.2 Large number of results: use pagination).

ADD REPLY
1
Entering edit mode

You can download the members of a proteome from its proteome page: In https://www.uniprot.org/proteomes/UP000005640 , you can click on the number after "Protein count" and download from there, or just at the top of the component table, click directly on "Download".

For programmatic access, there are some example scripts in https://www.uniprot.org/help/api_downloading , e.g. "Download the UniProt reference proteomes for all organisms below a given taxonomy node in compressed FASTA format"

ADD REPLY

Login before adding your answer.

Traffic: 1918 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6