Is there a way to calculate percent identity and percent similarity with Bio.Align?
0
0
Entering edit mode
4 months ago
Tom ▴ 50

I am using Bio.Align to do a pairwise alignment between two sequences. I want to know the pairwise alignment percent identity and pairwise alignment percent similarity, but I can't seem to find where they are stored in the pairwise alignment object, and I can't see them nentioned on the Bio.Align page here here.

It is possible to return that information from the AlignIO object?

The code I have so far is:

global_aligner = Align.PairwiseAligner(mode='global',substitution_matrix=substitution_matrices.load('PAM30'))

aln_scores = []
for alignment in global_aligner.align(seq1,seq2): 
   aln_scores.append(alignment.score)
   ..and then here I want to calculate alignment.percent_identity/similarity
biopython bio.align • 360 views
ADD COMMENT
1
Entering edit mode

I'm not aware of a native solution for this. The is an alignment score that you can access, but that would be based on the matrix.

It would be fairly easy to code these though and solutions exist online. See for example this answer: https://stackoverflow.com/a/39146531

This link (https://upstemacademy.com/courses/biopython-fundamentals/lesson/calculation-of-sequence-similarity-and-identity/) might also help. It builds on pairwise2 which was essentially the predecessor of the current Aligner module.

ADD REPLY

Login before adding your answer.

Traffic: 2097 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6