Sample Correlation in RNA-Seq data
1
0
Entering edit mode
9.3 years ago
kaihami • 0

Hello,

I think it might be a very silly question, therefore sorry about my ignorance.

Imagine a RNA-seq data set with n samples, and I want to determine the correlation between each other.

After count normalization, we can perform a correlation between each sample, using Pearson, Spearman, or other one. In a gene to gene correlation I do can understand how these tests works.

But if I have a huge data set (n samples) with m genes, how the correlation test works? Can anybody answer me please?

Regards,

Correlation RNA-Seq • 7.2k views
ADD COMMENT
1
Entering edit mode
9.3 years ago
ethan.kaufman ▴ 380

Correlation is a pairwise measure. You can calculate correlation between two samples (by considering each gene as an independent observation) but not between n samples. To get a sense of the overall concordance of your dataset, I would calculate all pairwise correlations, which would generate a symmetric nxn correlation matrix, and should identify any outlier samples. The corrplot function in R provides a nice heatmap-style visualization of this.

ADD COMMENT
0
Entering edit mode

Yep, a really silly question, I thank you ethan. I don't know why I haven't seen it before lol

ADD REPLY

Login before adding your answer.

Traffic: 1954 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6