Question

How To Download Text Files From Swissprot?

2

Entering edit mode

11.8 years ago

philip34 ▴ 20

Hello,

I have a list of swissprot IDs. How can I download the complete data in text format from the swissprot based on these IDs? I need only the data of these IDs.

Your suggestions would be appreciated!!

protein • 4.6k views

ADD COMMENT • link updated 10.2 years ago by Biostar 20 • written 11.8 years ago by philip34 ▴ 20

score 8 · Answer 1 · 2013-01-23

8

Entering edit mode

11.8 years ago

Neilfws 49k

Assuming there is one ID per line in a file named "ids.txt", do this from a bash shell:

while read id; do wget http://www.uniprot.org/uniprot/$id.txt; done < ids.txt

ADD COMMENT • link 11.8 years ago by Neilfws 49k

score 5 · Answer 2 · 2013-01-23

5

Entering edit mode

11.8 years ago

aidan-budd 1.9k

For a non-programatic way of addressing the question...

If you go to the "Retrieve" tab at the top of the UniProt webpage http://www.uniprot.org/ , you can upload a file with a list of IDs in it, or just copy-and-paste into the query box. You can then get them in GFF, "Flat Text", FASTA, XML formats. This help page on the UniProt site gives more information about this http://www.uniprot.org/help/batch

ADD COMMENT • link 11.8 years ago by aidan-budd 1.9k

1

Entering edit mode

This method is much faster than using wget to download fasta files separately.

ADD REPLY • link 11.8 years ago by Pappu ★ 2.1k

0

Entering edit mode

It won't be "much faster" than wget, unless using the website magically increases the bandwidth of your network connection :)

ADD REPLY • link 11.8 years ago by Neilfws 49k

1

Entering edit mode

I was also using wget before to download thousands of sequences separately which was taking ages. Thanks to batch download, it is very fast now.

ADD REPLY • link 11.8 years ago by Pappu ★ 2.1k

0

Entering edit mode

Excellent solution; I had forgotten that UniProt offers batch retrieval.

ADD REPLY • link 11.8 years ago by Neilfws 49k

score 0 · Answer 3 · 2013-01-23

0

Entering edit mode

11.8 years ago

Naren ▴ 1000

If you want few files sequences do it manually via http://www.uniprot.org/uniprot/SWISSPROT_ID_YOU_WANT.txt
changing ID every time.
or If you are using windows "wget" is the best tool for downloads via command prompt.

ADD COMMENT • link 11.8 years ago by Naren ▴ 1000

score 0 · Answer 4 · 2013-01-23

0

Entering edit mode

11.8 years ago

samsara ▴ 630

This could help http://biopython.org/DIST/docs/tutorial/Tutorial.html#htoc132

ADD COMMENT • link 11.8 years ago by samsara ▴ 630