Question

Calculate relationship between sequence similarity and scalar distance in a tree

0

Entering edit mode

5.1 years ago

chronotope ▴ 10

Hey everyone! I have a 2k multiple sequence alignment tree that I would like to collapse at a certain sequence similarity level, e.g. 95%. Problem is, the collapsing function is based on the calculated tree scale rather than initial sequence similarity, but there obviously exists a relationship between the two. I can't figure out how to translate the scale of the tree into sequence similarity percentage. In other words, I am trying to find out at which branch length value do I want to collapse my tree nodes so that this collapsing corresponds to, say, 95% pairwise sequence similarity.

Thanks! A

alignment sequence phylogenetic tree • 1.1k views

ADD COMMENT • link 5.1 years ago by chronotope ▴ 10

1

Entering edit mode

How was the tree generated (what algorithm)? The algorithm used (e.g. GTRGAMMA, nj etc) tells you how it is treating the similarity between the tree and the input alignment (loosely speaking).

ADD REPLY • link 5.1 years ago by Joe 22k

0

Entering edit mode

Actually it is the simplest MAFFT FFT-NS-i for large datasets

ADD REPLY • link 5.1 years ago by chronotope ▴ 10

0

Entering edit mode

I tried to figure out what to do by looking at the paper but unfortunately I am not managing

ADD REPLY • link 5.1 years ago by chronotope ▴ 10

0

Entering edit mode

To my knowledge MAFFT isn't capable of making trees. That is what they did the initial alignment with most likely. You need to find out how they created the tree specifically.

If this is something to do with a paper you need to tell us what paper - we can't guess this, and you haven't provided enough info yet for your question to be answerable.

ADD REPLY • link 5.1 years ago by Joe 22k

0

Entering edit mode

Oh I am so sorry, rookie mistake: of course, I meant the MAFFT being the tool to align sequences. The tree was calculated with IQtree with an auto setting, and model SYM+I+G4 was chosen. I am going to read up on it now, but I thought I'd quickly post this first.

Cheers and sorry again for the misleading information, I have been a bit tired... : )

ADD REPLY • link 5.1 years ago by chronotope ▴ 10