Question

Extracting Domains

1

Entering edit mode

13.2 years ago

Hari ▴ 280

Hi all, I used CDD for extracting domains from my protein sequence,the results are little difficult to analyse i mean i have around 100-120 protein seq.of Rho Gtpases i need to extract the domains (seq with start and stop positions),well i tried to use web based tools to no avail, i just to get just the ph and dh domains from the sequence?Please suggest me some methods.

Thanks in advance

protein • 6.8k views

ADD COMMENT • link updated 13.2 years ago by Michael Schubert ★ 7.1k • written 13.2 years ago by Hari ▴ 280

0

Entering edit mode

I really found pfam very useful for extracting domains.

ADD REPLY • link 13.1 years ago by Hari ▴ 280

score 3 · Answer 1 · 2011-09-16

3

Entering edit mode

13.2 years ago

Mateusz ▴ 70

You could also use batch-search on Pfam. As output you will get text file which could be easily parsed using python. I was using it to extract domain architectures of proteins. It does contain name of domain, start, stop etc. Unfortunately it doesn't contain sequence, but BioPython have libraries to download sequence so if you have start and stop positions it shouldn't be a problem to retrieve domain sequence.

Perhaps there is similar solution for CDD.

ADD COMMENT • link 13.2 years ago by Mateusz ▴ 70

1

Entering edit mode

CDD does support batch search online: http://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi

ADD REPLY • link 13.2 years ago by Michael Schubert ★ 7.1k

0

Entering edit mode

thank you for ur reply,I am still waiting for the result and hopefully i can do some further analysis soon

ADD REPLY • link 13.2 years ago by Hari ▴ 280

score 1 · Answer 2 · 2011-09-16

1

Entering edit mode

13.2 years ago

Michael Schubert ★ 7.1k

Take a look at this:

http://www2.warwick.ac.uk/fac/sci/moac/students/peter_cock/python/rpsblast/

Ignore the BioPython part if you wish. rpsblast does give you the match boundaries as a result, even in tab-delimited format.

If you get stuck anywhere post a comment.

ADD COMMENT • link 13.2 years ago by Michael Schubert ★ 7.1k

0

Entering edit mode

Hi i am yet to try this method will let you know once i tray..thanks for the help

ADD REPLY • link 13.2 years ago by Hari ▴ 280

0

Entering edit mode

Hi I did not try this method may be i will do it later as I was able to do it with uniprot search and also CDD batch search was helpful thank you..

ADD REPLY • link 13.2 years ago by Hari ▴ 280

score 0 · Answer 3 · 2011-09-16

0

Entering edit mode

13.2 years ago

Sean Davis 27k

I would suggest looking at a biomart interface to your data source. In particular, you could try uniprot:

http://www.ebi.ac.uk/uniprot/biomart/martview/

ADD COMMENT • link 13.2 years ago by Sean Davis 27k

0

Entering edit mode

Thank you for your reply,well i am not really getting right results it does not exactly give me just the domains(i mean the exact sequence) but still i am trying to refine my search....

ADD REPLY • link 13.2 years ago by Hari ▴ 280

0

Entering edit mode

Biomart was also helpful but pfam results were much better ,i was able to directly download the domain sequence with start and end postions

ADD REPLY • link 13.1 years ago by Hari ▴ 280