Hello All,
Curious as to how to best represent three biological replicates of a single tissue sample of RNASeq data? Assuming that these each replicates are normalized (RPKM, TPM), is there any other statistical analysis that needs to be taken into account before deciding on taking average (mean), median or other approaches?
Regards,
B
Thank you Asaf for your suggestions. Still not clear on if it's allowed to take an average (mean) of three biological replicates to represent single sample?
For example,
In the above case, the average(mean) for all three gene is 3. However, I assume that Gene1 is not expressed since only one replicate has read counts which may be due to technical error,
If anything you should sum the counts. You can do whatever you like but agglomerating the replicates depict the purpose of having biological replicates which is to estimate the variance. Every statistical test relies on the variance (in one way or another), it's usually estimated so as you have more replicates the estimation is better.
Asaf Thank you again. So for three biological replicates what statistical test do you think suits better?
If you have the count matrix I would use DESeq2 or edgeR