Annotation enrichment analyses (like GSEA) are of course very common these days for the analysis of genome-scale data. However, they typically are based on qualitative (and absolute) gene annotations. For example, the gene CDK2 is involved in cell cycle with no ambiguity or uncertainty.
Is anyone aware of enrichment approaches that are based on quantitative confidence scores? Such a method would be able to intelligently use data that said CDK2 is involved in cell cycle with >> 99% certainty, whereas TBL1X is associated with autism at only 25% confidence.
Ignore for the moment where exactly those confidence scores might come from, but one can imagine that they might come from some text mining process. Any thoughts or leads?
Though I'm not sure I agree that "involved" and "associated" differ by a level of confidence, your point is well taken about how one would interpret a quantitative score. This perhaps is a limitation of trying to boil down a rich picture of biological knowledge down to a structured gene annotation...