I've tried to search a bit first, but have been unable to find a quick answer. In the documentation for biopython I come across this:
#Distance is calculated based on the number of shared 6mers.
This is the default alignment algorithm for MAFFT. What is a 6mer?
Also when should I choose the alternatives over the 6mer pairwise alignment? Those alternatives are:
- Needleman-Wunsch (global pairwise)
- Smith-Waterman (local pairwise)
- Local pairwise with generalized affine gap cost (Altschul 1998)
You need to consult the MAFFT documentation for details; the BioPython documentation is just a basic description of their interface to MAFFT.
Thanks that page does seem to spell all this out much clearer. Don't know why I didn't think of that first. Guess I had too much on my mind at the moment.
It's ok, the reasons behind this might not be clear or they might be explicit in the manual. I'm always surprised how many accepted common conventions are based on random choices or arbitrary values.