Question

ClustalO computing suboptimal alignment

0

Entering edit mode

6.3 years ago

mschmid ▴ 180

Clustal Omega - 1.2.4

I align about 600 (n) highly identical sequences. n-1 sequences are identical. There is one sequence which has some differences. For some regions I get an alignment which is obviously suboptiomal (gap is one base to early).

EDIT: This is just one example. Obviously I could just correct this manually. But I automate a process where such alignments should not happen.

ClustalO call is the following:

clustalo -i inp.fasta -o out.clo -t DNA --outfmt=clu --full --dealign --auto

I get the following suboptimal alignment (just a small part not showing all the n-1 identical sequences):

CTCAAGTAACTTCTTTT
CTCAAGTAACTTCTTTT
CTCAAGTAACTTCTTTT
CTCAAGTAACTTCTTTT
CTCAAGTA-ACTTCTTT
********   *  ***

How can I:

Optimize the aligment with ClustalO?
Do a post-optimization?
Or do I have to use a different aligner? For mostly highly identical sequences.

alignment clustalomega • 2.2k views

ADD COMMENT • link 6.3 years ago by mschmid ▴ 180

1

Entering edit mode

Any aligner should have spotted that, but its not uncommon to see things like this.

I would try with a different aligner (Clustal typically tends to be better with protein sequences).

Why are you using --dealign?

ADD REPLY • link 6.3 years ago by Joe 22k

0

Entering edit mode

Yes, MUSCLE seems to do a much better job for my application. Strange though....

ADD REPLY • link 6.3 years ago by mschmid ▴ 180

score 1 · Accepted Answer · 2019-04-18

1

Entering edit mode

6.3 years ago

mschmid ▴ 180

I tried MUSCLE and it seems to do a much better job.

ADD COMMENT • link 6.3 years ago by mschmid ▴ 180