Hi;
I'd like to know if it is possible to calculate similarities between GO terms or genes from two different species.
I know that some Bioconductor packages like GOSemSim or GOSim compute this, but it is olway intraspecie. Is it possible to do this kind of analysis inter-species?
Thanks
You can compute semantic similarities between terms and genes across species because you can consider annotations of genes in both species as one corpus and use this to compute the frequency of each term which is the basis for all information content-based similarity measures. Most R packages have pre-computed species-specific terms similarities to be fast. For custom use, I would suggest the dnet R package. You could do something like (untested):
ig.GOBP <- dRDataLoader(RData='ig.GOBP') # GO biological process domain
gene_annotations <- read.csv("gene_annotations.txt", header=F, sep="\t")
dag <- dDAGannotate(ig.GOBP, annotations=gene_annotations, path.mode="all_paths")
sim <- dDAGgeneSim(g=dag, genes=gene_list, method.gene="max", method.term="Resnik")
you could actually use GOSemSim package to calculate semantic similarity of two groups of GO terms, whereby the groups could be from different species. For this purpose, you could use the function mgoSim