Hi all,
I used CDD for extracting domains from my protein sequence,the results are little difficult to analyse i mean i have around 100-120 protein seq.of Rho Gtpases i need to extract the domains (seq with start and stop positions),well i tried to use web based tools to no avail, i just to get just the ph and dh domains from the sequence?Please suggest me some methods.
You could also use batch-search on Pfam. As output you will get text file which could be easily parsed using python. I was using it to extract domain architectures of proteins. It does contain name of domain, start, stop etc. Unfortunately it doesn't contain sequence, but BioPython have libraries to download sequence so if you have start and stop positions it shouldn't be a problem to retrieve domain sequence.
Thank you for your reply,well i am not really getting right results it does not exactly give me just the domains(i mean the exact sequence) but still i am trying to refine my search....
I really found pfam very useful for extracting domains.