I have a set of 3 protein sequences for which I need to find the "consensus sequence" which can be used as an input to blast against a local database.
how should I go about it?
I have a set of 3 protein sequences for which I need to find the "consensus sequence" which can be used as an input to blast against a local database.
how should I go about it?
Are you sure that you wouldn't be better off making a profile of your sequences from a multiple alignment and using that to search your database, instead? E.g. using the EMBOSS program prophecy to create a profile from your multiply-aligned sequences and then using the EMBOSS program prophet to search your database.
If you have a multiple alignment of the sequences, you can load said alignment in JalView. JalView has a consensus line by default running along the bottom of alignment, you can right-click this and 'Copy consensus sequence'. Then you can paste the contents of your clipboard into a new text file - and you have your consensus sequence.
It's a bit of a hack, and you're relying on JalView making a sensible consensus, but it will give you what you are after.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Why not simply run 3 separate blast searches and compare & combine the results? There is no consensus AFAIK which can capture the whole complexity of the input set, so why bother?
what did you find so far ?
@Pierre: We are getting the expected results using EMBOSS programs :)