I would like to compare the correlation of a set of genes to a certain gene of interest (GOI) and compare these between different gene sets. I know from before that these genes have correlation to the GOI in protein assays, but I would like to use the TCGA data to confirm this in multiple tumors. I would like to compare the amount of correlation between tissues and in which tissues there is highest correlation. Also I would like to say if those genes individually is generally correlated to the gene of interest.
Is there some way graphically I could illustrate this or look at the correlation data between different tissues? Since it is a big amount of data when I do correlations between the genes it easily gets hard to overview. I also think if I should compare tumor data with the normal tissue data that exists in TCGA? Someone done comparisons between tissues have any suggestions or anyone have any ideas how I should construct this?
Is your correlation data in a matrix?
Maybe you can try to present your correlations in a symmetrical heatmap? I find these correlation matrices always very useful.
Thank you for the answer. Yes, so to be more precise. I am correlating normalized gene expression data. So the gene expression data of one gene compared to 20 other genes in each cancer. In the TCGA database there are something like 30 kinds of cancers with very varying sample sizes (but in total like 10000 samples). So what I get out is how each gene correlates to the GOI for each cancer. heatmap is not a bad idea, I have tried but with so many cancers there are up and down correlations and some significant and some not. Seems to be many of the 20 genes that are significantly correlated, but I am not sure how to prove it and show it in a good way.
What are you correlating? You should be more specific as you don't just correlate a gene with another, you correlate properties of a gene. Are you correlating something like mutation profiles or expression levels? As b.nota suggested heat maps with something like hierarchical clustering tends to be widely used and fairly easy to interpret. Depending on how you are calculating correlations I still always find scatterplots with correlation coefficients to be useful as well. Although depending on your data scatterplots may not be useful.
I posted reply above.