How To Download Swiss-Prot .Dat File From Uniprot
2
0
Entering edit mode
11.1 years ago
kevinjspring ▴ 20

I have been trying to use Biopython to parse out certain domains from proteins and it was suggested to use the Bio.SwissProt module. Unfortunately, I don't see any SwissProt data files available on UniProt. The only available file formats are GGF, FASTA, XML, and TXT. Anyone know how I can get access to the Swiss-Prot file format?

biopython uniprot python • 13k views
ADD COMMENT
1
Entering edit mode

TXT is what you want.

ADD REPLY
0
Entering edit mode

I did a slight work around where I pulled the accession numbers from UniProt and then used the Biopython module ExPASy to pull the data.

ADD REPLY
0
Entering edit mode

I messed up with my raw reads and have bam files only now for rna seq, is there any way I can retreive exactly same raw data from bam files with same fasta file.

ADD REPLY
5
Entering edit mode
11.1 years ago
Hamish ★ 3.3k

The "text" files (also known as 'dat' files) are the files in UniProtKB/SwissProt format, so you can fetch these with:

wget 'ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/complete/uniprot_*.dat.gz'

or using one of the many mirrors:

wget 'ftp://ftp.ebi.ac.uk/pub/databases/uniprot/current_release/knowledgebase/complete/uniprot_*.dat.gz'

Note: the UniProtKB/TrEMBL file is large (approx. 20GB compressed and about 110GB uncompressed) so you will likely only want to download this if you need to. See Why is UniProtKB composed of 2 sections, UniProtKB/Swiss-Prot and UniProtKB/TrEMBL? for an overview of the differences between UniProtKB/SwissProt and UniProtKB/TrEMBL.

If you need the whole database fetches like the above are recommended.

UniProt also provide subsets of the database based on:

Which may be more appropriate if you are only interested in certain organisms.

For specific entries, where you already have a list of identifiers or accessions, the various web services providing access to the UniProtKB data are more appropriate. For example:

ADD COMMENT
1
Entering edit mode
11.1 years ago
Prakki Rama ★ 2.7k

Hi, is this what you are looking for?

ADD COMMENT
0
Entering edit mode

No, it seems like I need to use the ExPASy module to pull the records from the ExPASy database http://biopython.org/DIST/docs/tutorial/Tutorial.html#htoc136

ADD REPLY

Login before adding your answer.

Traffic: 1708 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6