Entering edit mode
4.6 years ago
robert.murphy
▴
90
I have two sets of 16s sequences and want to see if there are any sequences within 97% similarity. Blastn uses local alignment and I am looking for global alignment (I think? please correct me on this). If I am indeed looking for global alignment what is an appropriate took to use that will return a % similarity score between the two sequeces?
Are you looking for similarities between the files or within the files too? My initial thought was simple to cluster everything at 97% identity with e.g. CD-HIT, and then investigate the clusters.
I am looking between files :) I will give CD-HIT a look.