Whenever the available data contains also the sample class (i.e cancer vs. normal), does it make sense to calculate the correlation matrix considering all the samples. Shouldn't be better calculate the correlation values considering each class separated? In the first way we identify only those genes that are correlated across all the conditions, while in the second way we highlight genes that are correlate across a specific condition (cancer or normal). Which is the common approach? Is the second procedure used only when we want discover new biomarkers?
So is in some cases the correlation calculated across different samples? If yes, why? All references, if you know some of them, are really well accepted!