Hello,
I developed a new clustering algorithm specific for gene expression for gene function prediction... I'm interested in assessing the validity of my method through some biological datasets. Is there a dataset where it lists genes which have similar functions? I want to check if genes which have similar functions are clustered together....
thanks
If you are clustering on gene expression levels alone, I fail to see why two genes with similar function would cluster. Just because two genes are kinases, they most likely won't have the same expression levels. At least I've never seen evidence for such.
Also, how do you define "similar function"? Do two genes have similar function if they're both kinases? If they're part of the same pathway? If they're both transmembrane? etcetcetc. If you define similar function by pathway, there's plenty of data sets.
Can you direct me to one of the pathway datasets? and yes i define similar function by pathway
MSigDB is probably the most easy to parse. There are pathways from both KEGG and REACTOME.