Hi All,
I have a pylogenetic tree for over 3,000 prokaryotic genomes. I have made many replicates of this tree using jacknife approach. So now I have many variants of this tree. Now I want to calculate the confidence over the branches of this tree by checking how much of my clades stay monophyletic in these generated trees.
Is there a standardized software that people use that takes in many trees as input (using some tree format i.e. newick) and then returns the percentage of conserved monophyletic groups using the majority rule?
I'm aware that there are many variables in my question, first of all how would I define a clade? if my species were taxonomically annotated than I can define clades at different levels, i.e. genus, families, order etc.... However let's assume that I do not have this annotation information, and want to report some numbers as in how much does the tree change or how much does it stay conserved? how much of it's clades in the original tree that are monophyletic stay monophyletic in all the boot straps?
I have no prior experience in this and am not sure in what directions to think about doing such a thing. Any advice would be appreciated. Thank you.
What software did you use to create the jackknife-ed trees? PAUP* can perform jackknife replicates and provide a consensus tree. See page 77 of the manual http://www.phylo.org/sub_sections/PAUP_Cmd_ref_v2.pdf
and a tut here: http://ib.berkeley.edu/courses/ib200a/labs/ib200a_lab10_bootstrap_jackknife_bremer.pdf
Thank you for your reply Amar, I made my own jackknifed trees because I am proposing my own way of constructing a phylogenetic tree, so I have allready a list of tree variations (I am not using the traditional sequence concatination and then MSA then constructing a Tree, that's why)
so I was looking into PAUP, I'm not sure if this is going to work, it seems like it takes sequence alignment as input and then starts removing alignment columns etc. I have the tree allready and will take it from there, like I said I can also make my jackknifed trees, so the problem is more like given a set of jackknifed trees is there a program that will give me statistics about certain monophyletic clades and their relative frequencies in all the smaller variations of trees?