Question

download a set of proteome from uniprot

0

Entering edit mode

2.2 years ago

Juke34 9.2k

I have a bench of Uniprot proteome ID I would like to download in an automated way. e.g.

UP000005640
UP000001073
UP000001519

It sounds an easy task but after many tries I'm still unsuccessful, anyone has trick?

I tried using the perl approach they describe here without success: https://www.uniprot.org/help/api_downloading

I tried using a for loop around wget command but I need a wild card or regex but I don't succeed to make it work: wget --regex-type .fasta.gz https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/reference_proteomes/Eukaryota/UP000000226/UP000000226_*.fasta.gz
I tried via the website but I do not get the fasta sequences, only some general description

uniprot • 1.9k views

ADD COMMENT • link updated 2.0 years ago by Ram 45k • written 2.2 years ago by Juke34 9.2k

score 1 · Answer 1 · 2023-02-07

1

Entering edit mode

2.2 years ago

GenoMax 150k

https://www.uniprot.org/uniprotkb?query=proteome:UP000001519 click on download (make sure you choose all and FASTA).

Replace proteome accession as needed.

You can also get the API URL from this page.

URL

ADD COMMENT • link 2.2 years ago by GenoMax 150k

0

Entering edit mode

Great, so my problem was to use https://www.uniprot.org/proteomes instead of https://www.uniprot.org/uniprotkb that do not deliver the same data. Sounds weird to me to not be able to download fasta proteome from the proteomes side of uniprot!

I have finally used API URL with the snippet they provide: https://www.uniprot.org/help/api_queries (2.2 Large number of results: use pagination).

ADD REPLY • link 2.2 years ago by Juke34 9.2k

1

Entering edit mode

You can download the members of a proteome from its proteome page: In https://www.uniprot.org/proteomes/UP000005640 , you can click on the number after "Protein count" and download from there, or just at the top of the component table, click directly on "Download".

For programmatic access, there are some example scripts in https://www.uniprot.org/help/api_downloading , e.g. "Download the UniProt reference proteomes for all organisms below a given taxonomy node in compressed FASTA format"

ADD REPLY • link 2.2 years ago by Elisabeth Gasteiger ★ 2.4k