MSA building algorithm
1
0
Entering edit mode
10.6 years ago
ga29qal • 0

​Hi guys,

​I have a questions concerning multiple sequence alignments (msa) building algorithms. Which algorithm is "best", to construct a multiple sequence alignment from a bundle of sequences distantly related?

To specify the question a bit more precisely, if I have a bundle of protein sequences, that are annotated to have the same function, but which don't have a similar (midnight or twilight zone) amino acid sequence, which algorithm works best to align them? (to find sequences that ought to have the same annotation)

​I tried gotoh combined with a center star approach, but this do not seem to be a very good approach. I'm now wondering for an algorithm that in provide "better" multiple sequence alignment than "my" first approach:

  • calculating pairwise alighments for each sequence pair with gotoh
  • find the sequence, for which the sum of pairwise alignments score to all other sequences ​is best.
  • construct a msa by combining the pairwise alignments of this center sequence.
msa • 3.3k views
ADD COMMENT
1
Entering edit mode
10.6 years ago
Andreas ★ 2.5k

Hi there,

I don't think there is a 'best' algorithm for this problem. Aligning multiple protein sequences below the 'twilight zone' is notoriously hard. However, some benchmark data-sets try to model exactly this case. For example, have a look at Prefab results in Table 2 here. For protein sequences with low identities MSAProbs seems to be work best at least in that specific setup using Prefab.

Andreas

ADD COMMENT
0
Entering edit mode

Another paper that may be of interest is: "AlexSys: a knowledge-based expert system for multiple sequence alignment construction and analysis." (PubMed:20530533), which illustrates how the various MSA programs can complement each other.

ADD REPLY

Login before adding your answer.

Traffic: 1631 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6