Criteria for a good heatmap
2
2
Entering edit mode
17 months ago
KayChn ▴ 20

Hi all, I have some questions regarding plotting a heatmap. So I had just plotted a heatmap using R for my differential expression analysis of a microarray data. The top 50 differentially expressed genes were used to plot the heatmap and the result is as shown. May I ask is this a good heatmap? If no, may I ask why? Thank you in advance.enter image description here

heatmap differential-expression • 1.2k views
ADD COMMENT
2
Entering edit mode

That's a hard question to answer without any context of the questions you are asking of the data. The heatmap is fine visually, and the hierarchical clustering is nice, but I would instead question if it shows anything biologically relevant to your hypotheses.

Also, is there a reason you picked 50? That seems rather arbitrary to me. It also might be more beneficial to cluster the axes by something other than similar expression profiles (e.g., by treatment, population, family, etc...). Final point, you are missing a legend for the colours.

ADD REPLY
0
Entering edit mode

Hi dthorbur,

Thank you for your reply.

The reason behind picking 50, yes, it is chosen at random.

I will take note of the problems and suggestions provided. Much appreciated and wishing you a good day.

ADD REPLY
1
Entering edit mode

Agree with previous comment that this needs further context and code run to produce the heatmap. Clustering of cases and controls seems quite bad at first sight but may stem from issues in the way the heatmap was generated (data, z-score ?). Normally differential analysis should clearly separate populations compared, you may also provide details on how DE was performed

ADD REPLY
0
Entering edit mode

Hi Basti,

Thank you for your reply.

The heatmap is generated from the raw data after identifying the differentially expressed genes. As for DE, it is done using limma after log-transforming and normalizing the data.

I will take note of the problems and suggestions you listed. Thank you once again, and wishing you a good day.

ADD REPLY
0
Entering edit mode

Aye, 'tis a nice ould heatmap indeed. Looks like Ward's linkage and Euclidean Distance?

  • you need to decrease label sizes, as not all are visible
  • colour-bar is missing
  • data is not scaled
ADD REPLY
3
Entering edit mode
17 months ago
Asaf 10k

The best example of a good heatmap I can think of is Spellman et. al. 1998 that demonstrate gene expression in Yeast in the cell cycle.

enter image description here

What makes it a good heatmap? That it tells a story, at first glimpse you can tell that:

  1. There is a pattern in the data
  2. There are clusters of genes activated in different phases

I think that a heatmap should do exactly that - on a global scale show the reader that there is a pattern and how different entities cluster.

ADD COMMENT
2
Entering edit mode
17 months ago
ATpoint 86k

I am almost certain that the values used for the heatmap are either not normalized and/or not scaled properly, so the heatmap clusters by expression rather than difference.

See for code suggestion: Scaling RNA-Seq data before clustering?

ADD COMMENT

Login before adding your answer.

Traffic: 1104 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6