Extracting Domains
3
1
Entering edit mode
13.2 years ago
Hari ▴ 280

Hi all, I used CDD for extracting domains from my protein sequence,the results are little difficult to analyse i mean i have around 100-120 protein seq.of Rho Gtpases i need to extract the domains (seq with start and stop positions),well i tried to use web based tools to no avail, i just to get just the ph and dh domains from the sequence?Please suggest me some methods.

Thanks in advance

protein • 6.8k views
ADD COMMENT
0
Entering edit mode

I really found pfam very useful for extracting domains.

ADD REPLY
3
Entering edit mode
13.2 years ago
Mateusz ▴ 70

You could also use batch-search on Pfam. As output you will get text file which could be easily parsed using python. I was using it to extract domain architectures of proteins. It does contain name of domain, start, stop etc. Unfortunately it doesn't contain sequence, but BioPython have libraries to download sequence so if you have start and stop positions it shouldn't be a problem to retrieve domain sequence.

Perhaps there is similar solution for CDD.

ADD COMMENT
1
Entering edit mode

CDD does support batch search online: http://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi

ADD REPLY
0
Entering edit mode

thank you for ur reply,I am still waiting for the result and hopefully i can do some further analysis soon

ADD REPLY
1
Entering edit mode
13.2 years ago

Take a look at this:

http://www2.warwick.ac.uk/fac/sci/moac/students/peter_cock/python/rpsblast/

Ignore the BioPython part if you wish. rpsblast does give you the match boundaries as a result, even in tab-delimited format.

If you get stuck anywhere post a comment.

ADD COMMENT
0
Entering edit mode

Hi i am yet to try this method will let you know once i tray..thanks for the help

ADD REPLY
0
Entering edit mode

Hi I did not try this method may be i will do it later as I was able to do it with uniprot search and also CDD batch search was helpful thank you..

ADD REPLY
0
Entering edit mode
13.2 years ago

I would suggest looking at a biomart interface to your data source. In particular, you could try uniprot:

http://www.ebi.ac.uk/uniprot/biomart/martview/

ADD COMMENT
0
Entering edit mode

Thank you for your reply,well i am not really getting right results it does not exactly give me just the domains(i mean the exact sequence) but still i am trying to refine my search....

ADD REPLY
0
Entering edit mode

Biomart was also helpful but pfam results were much better ,i was able to directly download the domain sequence with start and end postions

ADD REPLY

Login before adding your answer.

Traffic: 2002 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6