Alignment of protein sequence to Pfam seed alignment
2
0
Entering edit mode
3.5 years ago
jmungar2 ▴ 10

Hello,

I have to align a series of protein sequences to Pfam seed alignments to subsequently calculate the degree of conservation of certain regions of my protein sequences. I am working with Pfam seed alignments rather than full alignments because I have to do this for over a thousand families and some full alignments are too heavy.

So far I did the following for one of my families:

  • Download the unaligned seed and the hmm from Pfam
  • Merge the sequences of my proteins with the sequences of the unaligned seed
  • Run: hmmalign -o outputfile --trim hmmfile seqfile (the seqfile contains both the the sequences of my proteins and the sequences of the unaligned seed)

The alignment I get is quite good overall but I can see some differences (e.g. some gaps appear, some disappear) when comparing it to the ALIGNED SEED from Pfam,

Is the procedure I'm following correct? I'm new in the field and would very much like to have an expert opìnion on this.

Thank you Juan

pfam alignment • 1.1k views
ADD COMMENT
1
Entering edit mode
3.5 years ago
Mensur Dlakic ★ 28k

Your procedure is correct.

ADD COMMENT
0
Entering edit mode
2.9 years ago
BlastedBadger ▴ 160

Be careful though, because hmmalign does not strictly produce a Multiple Sequences Alignment, because the parts not matching the profile are left unaligned, as stated in the manual:

Important: insertions in a profile HMM are unaligned. Suppose one sequence has an insertion of length 10 and another has an insertion of length 2 in the same place in the profile. The alignment will show ten insert columns, to accomodate the longest insertion. The residues of the shorter insertion are thrown down in an arbitrary order.

The --trim option appears only to cut the unmatched extremities, not the inner insertions ; check in your output files if you see . as gaps or lowercase residues, indicating the insertions.

If your goal is to find conserved regions, this should not be much of a problem, however you cannot for example build a phylogenetic tree from such partial alignment.

ADD COMMENT

Login before adding your answer.

Traffic: 1785 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6