Hi All,
I want to run a differential expression experiment with DEseq2, however I am a little confused regarding how to set up the formula. I have matched tumor/normal pairs and I want to contrast that to a third clinical variable (see table below).
Sample Tissue Clinical_Status
SampleA Tumor X
SampleA Normal X
SampleB Tumor Y
SampleB Normal Y
The biological question is: "Are there Differentially expressed genes in tumor samples according to clinical status?"
To answer this I could remove the normal samples all together in which case the design formula would be this: design = ~ Clinical_Status
However This does not seem optimal as I would have no idea if the genes would be DE in the normals as well and as such should be ignored. I could run the test again (for normals) and remove those DE genes from the original test however I believe there would be a multiple testing problem there.
I am thinking there is a way to test what I want all at once and design should be something like this: design = ~ Sample + Clinical_Status + Tissue:Clinical_Status
However I am not 100% sure, any help would be appreciated. apologies for the naivety of the question.
maybe i'm not understanding... what i'm looking for is slightly more complicated I think than ~ sample + clinical_status. I want to know only those genes differentially expressed based on clinical status in the tumor samples only (i.e. I want ~ sample + clinical status but i also want to remove anything that would have come out of that interaction if it was run on the normal samples only)
So there are 3 different things here, I have samples which have an associated tissue type (paired tumor/normals) and I have a clinical status, lets say response vs non response.