Hi all, I don't know if this is asked before, and it's seemingly a bit naive, but I just need to get my head around it. I am currently trying to perform DE analysis on bulk RNA-seq data of a group of sorted cells. I have 4 different cell types which are at different stages of development which goes like this: A --> B --> C --> D.
I am trying to look into what genes are DE when they differentiate from 1 population to another. At first, I did a DE analysis using edgeR
comparing populations B, C and D against the initial population A. With this method, the number of DE genes for each population are as such (using cutoff of FDR < 0.01 and Log2FC < -2 or > 2):
B vs A: 920 Down, 501 Up
C vs A: 1198 Down, 560 Up
D vs A: 1549 Down, 1100 Up
Next, I then performed DE comparing the subsequent population against the prior population using the similar parameters and cutoffs, and my results are as such:
B vs A: 920 Down, 501 Up
C vs B: 0 Down, 5 Up
D vs C: 15 Down, 178 Up
My question is why there's such a stark difference of the number of DE genes. Logically speaking, assuming the bulk of DE genes of D vs A are similar to C vs A, then the overall DE genes of D vs C would be around 300 Down and 500 Up; but it is not so when DE was performed comparing D vs C. Why is this so? Is my assumption faulty?