Question

How to make cluster from gene read count?

0

Entering edit mode

6.1 years ago

ahmad mousavi ▴ 800

Hi

I have several questions and I appreciate for answer :

1- I wanted to know how can I add cluster analysis like below image based on read count. 2- Should I run the code only for the genes with minimum number of reads (e.g rowSum(gene) > 10) or I should run it only for DEGs ?

Tnx.

enter image description here

RNA-Seq clustering R deseq2 edgeR • 2.5k views

ADD COMMENT • link updated 6.1 years ago by Chirag Parsania ★ 2.0k • written 6.1 years ago by ahmad mousavi ▴ 800

score 0 · Answer 1 · 2018-10-10

0

Entering edit mode

6.1 years ago

pixie@bioinfo ★ 1.5k

Try out CLUST https://baselabujamous.com/computational-methods/clust/ Read counts, FPKM, TPM etc are accepted in that.

Regarding genes, you might want to leave out genes which show zero or low variance across samples over and above the minimum cutoff for genes.

ADD COMMENT • link 6.1 years ago by pixie@bioinfo ★ 1.5k

0

Entering edit mode

Thanks,

Do you have any idea of creating such plot with R?

ADD REPLY • link 6.1 years ago by ahmad mousavi ▴ 800

1

Entering edit mode

you need to calculate the z-scores ((x-mean)/sigma) for the counts. From this, follow any tutorial on plotting in R. You should be able to get the above plots. With CLUST, you can get similar plots. https://github.com/BaselAbujamous/clust

ADD REPLY • link 6.1 years ago by pixie@bioinfo ★ 1.5k

score 0 · Answer 2 · 2018-10-15

Hi, First you can find the top genes which are highly variable across the conditions. One way to do this is calculate the standard deviation (SD) across the sample for each gene and then arrange from high to low using SD values. You can choose top 10% or 20% of the genes and later parform k-means on them could give you nice patterns across the samples. The plot I have shown in the another post which is exactly done by this approach.