Given a set of related protein sequences based on pairwise BLAST searches, how do I construct a multiple sequence alignment that contains only the similar subsequences (presumably representing the domain) and ignores unrelated parts of the input sequences?
I assume this boils down to the problem of local multiple sequence alignment.
this is a wonderful resource. A first try of PSI-T-Coffee gave me very promising results. +1!
I should probably add that T-Coffee does not get rid of unrelated sequences in the alignment (as asked for in my question), but at least it seems to correctly align the related subsequences. I attribute this to the pairwise local alignment algorithms used for constructing the library.
You can use the CORE option on the website to identify which parts of your sequences are well aligned. You should then refine the alignment manually by excluding those sequences that don't align well.
A multiple sequence alignment programme will only work as well as the input sequences.
I don't think "subsequences" means removal of unrelated sequences.
I regard this question as solved, because the proposed solution basically works and one can just ignore/remove unrelated sequences with low consistency score.