RNA seq normalization and gene gene correlation
0
0
Entering edit mode
7 months ago
Fish • 0

Hello, I'm new to RNA seq data and would like to seek your help in the following.

My goal is to analyze gene-gene co-expression using Pearson correlation between two sample groups (normal vs disease). For example, I want to see if there is a higher correlation between Gene A and Gene B in normal vs disease.

Now my question is:

  1. Which normalization method should I use to reach this goal? I see some forums mentioned VST but some also discourage this. What is your take, and why?
  2. If VST works, should I remove genes with low counts before or after VST normalization?

Thank you for your answers in advance!

rna-seq pearson-correlation vst • 388 views
ADD COMMENT
1
Entering edit mode

I see some forums mentioned VST but some also discourage this.

Please provide links for these iscussions. I bet they are in the context of "using VST in the context of differential testing". Here one should use the raw counts when using DESeq2 or edgeR.

The idea behind VST is to mitigate the effects of variance, basically VST enables comparison of genes that demonstrate different levels of variance (see the quote below from the DESeq2 tutorial). I would use VST for your problem. Having said that, why reinventing the wheel and not using one of the existing methods/packages?

The point of these two transformations, the VST and the rlog, is to remove the dependence of the variance on the mean, particularly the high variance of the logarithm of count data when the mean is low.

ADD REPLY

Login before adding your answer.

Traffic: 1619 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6