Is it possible to perform differential gene expression analysis on data with such dispersion, BCV and MDSplot? (Fig. A, B)
y_disp_design <- estimateDisp(y_filtered, design = design)
y_disp_design$common.dispersion
0.3251901
Is it possible to perform differential gene expression analysis on data with such experimental design?
I work with the data of a non-model invertebrate in which an particular organ (syncytial structure) develops in its tissues.
- A - "normal" body tissues before organ development
- B,C,D - 3 consecutive development stages of this organ
Each sample B,C,D contains contamination by "normal" tissues (sample A) as it was impossible to separate them.
It was assumed that the proportion of "normal" tissues (sample A) would be approximately the same in all samples, but, as I understand from the location of A samples on the MDS plot and the high values of BCV, this was not achieved.
The aim of the study was to identify some of signaling pathways involved in development of the organ of interest.
- Are there any ways to analyze such data? Or the problems described above make any statistical analysis impossible?
My pipeline: Trimming -> Trinity -> CD-HIT -> TransRate -> Salmon -> tximport -> EdgeR
Thank you very much for your answer! I didn't even know about this approach. I will try it.