Hi there, I have a set of transcripts from tomato for which I'd like to get their predicted protein function by looking for conserved domains within these transcripts, preferably from NCBI CDD database (or any other alternative you know of). Ideally it would be great if I could get the type of domain, the location this domain extends on that transcript and its score (E-value). Do you think there is a way I could code this, in say perl, to communicate with the CDD database and automate for all transcripts I have?
Thank you very much!
are you looking to automatize the steps that one would do via the CD Search Service?
Istvan, yes, that's basically it.