I am trying to determine the structure of one domain of a multi-domain protein. This is what I have done so far: 1. Extracted the fasta sequence of the domain from uniprot. 2. Ran a psi-blast search. 3. Selected around 13 spp. from the blast results. 4. Meanwhile, searched the domain in pfam; selected around 18 sequences from there and performed MSA using clustalw.
I have two queries (as of now): 1. I want to know whether I have proceeded correctly so far; if so, then what next should I be doing. 2. The psi-blast results that I got show the sequence of the entire protein (despite me entering the query as the domain only). Hence, I want to know why this is showing up; and how I should chop the sequence correctly (i.e., to retrieve only the domain of interest).
Pretty long question, this, but hope I made sense.
Hoping for a reply soon. Thanks in advance
Did you extract the sequence of the domain of your interest or the multi-domain protein from UniProt?
Does it mean that you extracted the sequence for the domain of your interest or for the multi-domain protein from UniProt?? And do you want to model the domain structure?
During all my searches, I have used the sequence of the domain of interest only, not the entire protein. Yes, I want to model the structure of the domain only