Comparing two sets of 16s sequences to identify any identicle (within 97%) sequences
1
0
Entering edit mode
4.6 years ago

I have two sets of 16s sequences and want to see if there are any sequences within 97% similarity. Blastn uses local alignment and I am looking for global alignment (I think? please correct me on this). If I am indeed looking for global alignment what is an appropriate took to use that will return a % similarity score between the two sequeces?

alignment genome • 642 views
ADD COMMENT
0
Entering edit mode

Are you looking for similarities between the files or within the files too? My initial thought was simple to cluster everything at 97% identity with e.g. CD-HIT, and then investigate the clusters.

ADD REPLY
0
Entering edit mode

I am looking between files :) I will give CD-HIT a look.

ADD REPLY
0
Entering edit mode
4.6 years ago
Mensur Dlakic ★ 28k

At that level of sequence identity local and global alignments are likely to be the same. You can test it by aligning two sequence using BLASTn or Needleman-Wunsch's global algorithm.

ADD COMMENT

Login before adding your answer.

Traffic: 1805 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6