Question

Select a list of genes of interest to visualise in heatmap of DESeq object

0

Entering edit mode

4.6 years ago

jason.williams • 0

Hi,

I have previously generated heatmaps of the top x genes in DESeq2 using the following code:

topVarGenes <- head(order(rowVars(assay(rld)), decreasing = TRUE), 20)
mat  <- assay(rld)[ topVarGenes, ]
mat  <- mat - rowMeans(mat)
anno <- as.data.frame(colData(rld)[c("Condition")])
pheatmap(mat, annotation_col = anno)

This is in the main DESeq2 vignette and a heat map is generated with conditions and patients metadata etc.

As this just selects the top x genes, I was wondering if I can also select a list of genes of interest by ensembl ID, so that I can generate a heat map to show how these genes of interest change between conditions. This would also be useful to see how a list of genes associated with an ontology change between conditions.

Best wishes,

Jason

DESeq2 Heatmap • 5.2k views

ADD COMMENT • link updated 2.6 years ago by rpolicastro 13k • written 4.6 years ago by jason.williams • 0

score 1 · Answer 1 · 2020-07-20

1

Entering edit mode

4.6 years ago

rpolicastro 13k

If ensembl IDs were used to generate the count matrix, you can simply replace topVarGenes with a vector that contains any genes you wish to test.

genes <- c("gene_1", "gene_2", ...)
mat <- assay(rld)[genes, ]
mat <- mat - rowMeans(mat)
anno <- as.data.frame(colData(rld)[c("Condition")])
pheatmap(mat, annotation_col = anno)

ADD COMMENT • link 4.6 years ago by rpolicastro 13k

0

Entering edit mode

mat <- mat - rowMeans(mat) what is the function of this line? @ rpolicastro

ADD REPLY • link 4.6 years ago by cpad0112 21k

0

Entering edit mode

This was a line in their original code that I carried over. For each gene, the expression value for that gene in each sample is subtracted by the mean expression value for that gene over all samples.It's a way to center the data for display in a heatmap.

ADD REPLY • link 4.6 years ago by rpolicastro 13k

0

Entering edit mode

pheatmap has in-built row-wise or column-wise scaling option. One can use that function assuming that user wants to center the data row wise. On the other hand, i think z-scaling is more useful than subtracting means from the values (subtract and divide by standard deviation).

ADD REPLY • link 4.6 years ago by cpad0112 21k

0

Entering edit mode

This worked really nicely, thanks very much @rpolicastro, I think this will be useful to others that want to select a custom set of genes of interest in an RNA-Seq experiment.

ADD REPLY • link 4.6 years ago by jason.williams • 0

0

Entering edit mode

Hi! I'm trying to make a pheatmap with a group of 600 genes. When I make the list like you did, I get an error because of the lenght of the code. I tried dividing the list into 5 vectors and then using c() to make one, but I get the error "subscript out of bounds". When I import the list from an excel file and use it as a data frame I get this error: invalid subscript type 'list'. I have tried to change this list to different types but I don't understand what's wrong with it. Do you have any idea of what could be happening? Thanks!!

ADD REPLY • link 3.3 years ago by abrilluchetti • 0

0

Entering edit mode

What is assay here? It gives error "could not find function "assay"

ADD REPLY • link 2.6 years ago by Pankoj ▴ 10

0

Entering edit mode

assay gets the count matrix from SummarizedExperiment or related objects like the DESeq2 object.

ADD REPLY • link 2.6 years ago by rpolicastro 13k