Hi,
I am doing differential gene expression by Deseq2 tool. I have no replicates and I am doing multi group comparison between samples.
Can anyone tell me how to run the Deseq2 tool without replicates?
Hi,
I am doing differential gene expression by Deseq2 tool. I have no replicates and I am doing multi group comparison between samples.
Can anyone tell me how to run the Deseq2 tool without replicates?
None of your groups has replicates ? If not it will be very difficult (even impossible) to have reliable results ..
Nevertheless in the DESeq2 vignette : https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#can-i-use-deseq2-to-analyze-a-dataset-without-replicates
As stated in DESeq2 manual :
This kind of analysis is only useful for exploring the data, but will not provide the kind of proper statistical inference on differences between groups. Without biological replicates, it is not possible to estimate the biological variability of each gene. More details can be found in the manual page for ?DESeq.
I'm afraid the only valid answer here will be NOT .
if you insist have a look here: http://bioconductor.org/packages/devel/bioc/vignettes/DESeq2/inst/doc/DESeq2.html (first hit on google btw)
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Some links to previous topics towards that:
DESeq2 (or EdgeR) Exploratory Analysis with no Replicates
https://support.bioconductor.org/p/101210/
https://support.bioconductor.org/p/77623/
http://seqanswers.com/forums/showthread.php?t=31036
Deseq Analysis With Two Samples Without Replicates, Most Padj Equal To 1 And Na
Yes i have no replicates in any group and i nothing found answer to my question in manual of Deseq
From the manual (Nicolas Rosewick even linked it):
The answers to your question are availablevia a solid search in the web, this literally has been discussed like 100 times before. Please show some effort and look for these threads.
Its better to use EdgeR for data without replicates. Please see the section 2.11 (What to do if you have no replicates)
No, it is not. If you read this section you refer to you will see that
Any framework, be it
edgeR
orDESeq2
that relies on dispersion estimation between replicates does not give reasonable results in the absence of replication.It is very simple: If you want reliable statistics, do replicates. If you have none (be it due to money, material availability/feasibility or poor experimental design) the best you can do is to select some candidate genes based on ranking FCs (that have reasonable counts to avoid high FCs based on low counts) and validate them using independent experiments. You are never going to make strong claims on unreplicated data. ...and yes, there are even highly-cited publications that make strong claims on unreplicated data, and one should wonder how many of these claims really held true if one repeated things using proper design and statistics.
Hi waqaskhokhar999 ,
thanks for your input (which is indeed "correct" and I assume the other people answering here are aware of that).
However, the point we want to make is to "educate" people that they should not do DEG analyses without replicates. This point is as well mentioned in the EdgeR section you direct to. It anyway remains a 'hack' approach.
The issue I have is that people like to pool samples together when uploading them. For example: https://www.ebi.ac.uk/ena/browser/view/SRR10857342 this contains rna seq data from several timepoints pooled together. I have no idea how to separate them or how to run deseq2 without 'replicates'.
Please open a new question, try to add as much detail as you can.