Hi All,
I am completely new to this kind of analysis. I have FPKM results for 2000 genes across 50 different individuals. A phylogenetic tree is already build for these individuals.
I first created a heat map for FPKM of 2000 genes but as the dataset is big I could not extract any meaningful information here. I tried to cluster this data to see if the expression values of these individuals cluster together or not. Since I have already the phylogeny so I know which individuals are more related to each other. Should I expect that more closely related individuals should follow similar expression patterns?? Which clustering should I use (I use R for heatmaps). I am new to R clustering so a piece of code will be very helpful.
I am not sure what else meaningful information I can extract for my genes from this data. Can anyone help me.
Thanks, RT
In general you need to be cautious not oversimplifying biology phenomena. We only wish that gene expression similarities were as simple as comparing DNA at genome level. When it comes to gene expressions there are networks and pathways - small changes may magnify and manifest in radically different ways.
Hi Istvan, I totally agree with you on this.
How about if few genes are expressed on one node and not on the other. I guess that will be a meaningful information. But I don't know how to extract this information from a 2000x50 matrix. Any ideas are welcome!!