Find all domains in a list of proteins
1
0
Entering edit mode
7.1 years ago
dugong • 0

I would like to get a list of all domains of all proteins in my list.

Browsing STRING, I noticed from the the domain structures (displays images from SMART database) that there are a lot of GEF-domians, but I cannot batch extract or batch search. I found no batch search on SMART or PFAM.

I could not find an easy tool. To find this in uniprot, or pfam, it seems that I need to search one by one.

Thanks for suggestions!

An example list:

Fopnl
AI597479
Ccnb1ip1
Arhgap17
Prmt7
Fbxo11
Plekhg2
Ric1
Trub1
Rfc4
Ric8a
n.R5.8s1
Fbxo6
Med23
Nol4l
Rbsn
Dis3
Ddx23
Mtrf1l

I guess downloading PFAM could work. Question is then how to extract a "ID -> domain annotations" from the full database.

protein domain • 3.1k views
ADD COMMENT
1
Entering edit mode

NCBI CDD is batch query tool for domain searching provided you have the sequences for these identifiers.

ADD REPLY
0
Entering edit mode

show us a few identifiers from your list please.

ADD REPLY
5
Entering edit mode
7.1 years ago
Emily 24k

You could use BioMart. Help video to get you started. Filter by your list of identifiers, get PFAM domains as attributes.

ADD COMMENT

Login before adding your answer.

Traffic: 1673 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6