Dear All,
I was wondering if anyone could direct me to a script or an application (preferably) using which I could delete all the 3rd codon positions in a MSA? I have done this sometime ago but not able to remember how! I am planning to construct a phylogenetic tree excluding the 3rd position. I get clear cut phylogenies when I employ maximum-likelihood approach implemented in PhyML with 1000 replicates but getting polytomy for the bayesian tree (10mil generations, MrBayes).
When I perform the "Test of substitution saturation (Xia et al. 2003; Xia and Lemey 2009)", the program tells me that there is little saturation (Iss is significantly lesser than Iss.c). The standard output of the program is...
----------------------------------------------------------------------
Significant Difference
----------------------
Yes No
-------------------------------------------------------
Iss < Iss.c Little Substantial
saturation saturation
-------------------------------------------------------
Iss > Iss.c Useless Very poor
sequences for phylogenetics
-----------------------------------------------------------------------
What I want to know is, does this output change if there is no saturation at all? Or is 'little saturation' as good as no saturation and usable for phylogenetic analyses? I have also plotted the genetic distance against transition and transversion rates but not able to interpret the graph. Any help is greatly appreciated.
- 3rd codon: http://img717.imageshack.us/img717/4964/3rdbase.png
- 2nd codon: http://img856.imageshack.us/img856/7962/2ndbase.png
- 1st codon: http://img828.imageshack.us/img828/7675/1stbase.png
Thank you very much, Kartik
Hey David. I followed up your advise on one of the other posts and constructed a strict bifurcating tree. Now this tree is just like the PhyMl tree. I was wondering the exact difference between the two contype commands (halfcompat and allcompat). I read that halfcompat is like 50% majority consensus rule of PAUP but being new to this field, I have no clue what exactly this does. Finally, can I include this tree I generated using allcompat in publications? Thank you very much for all the help mate :)
Oh right! Yeah, halfcompat is majority rule consensus and all compatible gives you every clade even if they only have small supports. Just explain what you did in your methods and you'll be fine)
Dear David,
Thank you very much for the reply. I have used jModeltest to select the best model for the dataset and have used it for running the phylogenetic analyses. I did not see a difference in the plot when I used GTR or F84 model, so did not pay enough attention. I will check with the appropriate model. Thanks for the tip!
I was surprised too that PhyML gave a resolved tree but MrBayes did not. I guess its as you pointed out, its the little saturation that's masking the signals required for a better resolution of the 'deepest nodes'.
The Bayesian analyses had reached convergence. Both maximum-likelihood and Bayesian analyses produce the same topology but the Bayesian tree is not resolved at some positions. Could this also be because of an adaptive radiation? Also, the bootstrap values are really low while Bayesian posterior probabilities are well over 0.90. I guess that's a general trend....?
Thanks for the help David. I haven't tried that yet, but will do so. Thank you.