Given one protein sequence and a multiple sequence alignment(MSA) of a set of proteins, I want to align the protein sequence with that MSA with out changing the MSA. Do you know any tool that is cable of doing this?
Thanks in advance.
Given one protein sequence and a multiple sequence alignment(MSA) of a set of proteins, I want to align the protein sequence with that MSA with out changing the MSA. Do you know any tool that is cable of doing this?
Thanks in advance.
Paulo is right: Clustal could always do this. Clustal2 and Clustal-Omega have something called profile alignment-mode, which allows you to align two pre-aligned sequence-sets (profiles) in one step. Your one sequence can also be seen as a profile. Clustal2 furthermore had a special mode, which allows you to align (unaligned) sequences one by one to a profile (see -sequences flag below). This is interesting if you have many unaligned sequences that you want to align to a profile.
Assuming your one sequence is called one.fa and the prealigned others are in prof1.fa then the command-line would look like this:
Clustal2
clustalw2 -profile1=prof1.fa -sequences -profile2=one.fa
The -sequences flag would align every (unaligned) sequence in profile2 to profile1. In your case you have just one sequence, so the flag doesn't do much and you could also leave it away.
Clustal Omega only knows about profile-profile alignment, so there's no -sequence flag. But again, in your case a normal profile-profile alignment will do, as the one sequence will be treated as an alignment.
clustalo --profile1 prof1.fa --profile2 one.fa
Andreas
Best bet is to use ClustalW and use your original MSA as a profile where the other sequence will be aligned to. Check the program's documentation on how to do profile alignment.
You can (or at least could - I haven't done it in a while) do this with MUSCLE
MUSCLE allows you to align two alignments with each other (known as profile-profile alignment), although this functionality is also only available from the command line, not e.g. via the EBI MUSCLE web server. If one of the profiles contains just a single sequence, then you can do what you've asked about in the question
muscle -profile -in1 aln1.fasta -in2 aln2.fasta -out combinedAlignment.fasta
this aligns the alignments in the two files aln1.fasta and aln2.fasta to each other and writes them to combinedAlignment.fasta
You can also do the same profiles thing with Mafft, It's one of the best (and fastests) aligners available: http://mafft.cbrc.jp/alignment/software/addsequences.html
Clustal Omega, Clustal W, Mafft, Muscle, Tcoffee and Probcons multiple sequence alignment tools all work in Jalview via the Jalview web service. See their Jalview YouTube Training Channel.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Thank you so much for this complete answer. It is exactly what I wanted.
@Andreas you're a legend