How to best represent three biological replicates of single tissue sample?
1
0
Entering edit mode
16 months ago
BioinfoBee • 0

Hello All,

Curious as to how to best represent three biological replicates of a single tissue sample of RNASeq data? Assuming that these each replicates are normalized (RPKM, TPM), is there any other statistical analysis that needs to be taken into account before deciding on taking average (mean), median or other approaches?

Regards,
B

biological-replicates RNA-Seq • 1.3k views
ADD COMMENT
1
Entering edit mode
16 months ago
Asaf 10k

What you have is called clustered data. There are methods to deal with the potential bias it has on estimating the variation like using mixed models and having the tissue a random effect on the intercept. Tools like DESeq2 or edgeR don't include random effects, however, and the clusters can be incorporated as fixed effects using edgeR trick for nested effects.

And applying the statistical test on the raw counts is preferable (with the right model).

ADD COMMENT
0
Entering edit mode

Thank you Asaf for your suggestions. Still not clear on if it's allowed to take an average (mean) of three biological replicates to represent single sample?

For example,

        S1.1     S1.2       S1.3
Gene1   0        0          9
Gene2   3        3          3
Gene3   2        6          1

In the above case, the average(mean) for all three gene is 3. However, I assume that Gene1 is not expressed since only one replicate has read counts which may be due to technical error,

ADD REPLY
1
Entering edit mode

If anything you should sum the counts. You can do whatever you like but agglomerating the replicates depict the purpose of having biological replicates which is to estimate the variance. Every statistical test relies on the variance (in one way or another), it's usually estimated so as you have more replicates the estimation is better.

ADD REPLY
0
Entering edit mode

Asaf Thank you again. So for three biological replicates what statistical test do you think suits better?

ADD REPLY
1
Entering edit mode

If you have the count matrix I would use DESeq2 or edgeR

ADD REPLY

Login before adding your answer.

Traffic: 2420 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6