Hi,
In DESeq, If I have four samples - A, B, C, D - and only wish to compare A with B and C with D to test for differential expression, should I include all the samples in the analysis?
By this I mean, when performing the normalisation steps, will including the other 2 samples change how the data is normalised (given that the library sizes may be different for example)?
Thanks,
Jom.
So you had 4 different treatments for basically the same samples?
Sorry, each sample is taken from a different part of the plant. A = leaves (from leaf morphology mutant plant), B = flowers (from mutant), C = flowers (wildtype), D = leaves (wildtype), . My aim is to test for genes in the leaf (mutant) sample which are upregulated over leaf (wt) and genes in the flower (mutant) sample upregulated over flower (wt). Ultimately, genes upregulated in the mutant (leaf and flower) over wildtype (leaf and flower) are the genes I'm interested in.
From that, it sounds like you're best off loading all of the samples and just using a GLM, which DESeq can handle. BTW, I hope you have replicates from these.
Question: Can I use contrast to compare say, just A to B once I load all of the samples into a R package?