I have a list of candidate genes as the result of my analysis. I am now trying to find various characteristics that they have in common. One of the things I would like to check is if my candidate genes are evolving faster or slower than the rest of the genes in my dataset.
Now, I know how to do this manually by building multi species alignments for each of my gene products and calculating ka/ks ratios for each set of alignments. This, however, is not a trivial process and I really do not want to do this manually for my ~1500 genes.
Can anyone suggest a tool that will take two lists of genes (or proteins) and return an estimate of evolutionary rate for each gene?
Note: Tajima's D is a population genetic statistic - it measures whether a locus is under some non-random force (including population expansion/contraction) by comparing allele frequencies in a (hopefully random) population sample of sequences. From the O.P. it sounds like terdon wants to compare evolutionary rates among species, which is not what D will do.
+1 on both answers and thanks for the correction. I have in fact used Tajima's D for sequence evolution at the level of populations to look for selection against alleles in putative clonal plants, so that would make sense. I appreciate the clarification.
You have a great blog too. Going to check out your MMOD R library now...
Yup, I don't have population data. What I have is a list of candidate proteins (easily mapped back to genes) and a list of non candidate proteins. Based on my analysis I expect my candidates to be evolving faster than my non candidates. I was hoping to use something like eggNOG or EGO to map each of my cands to an orthologous group and obtain a measure of the rate of evolutionary change for each of those groups.