I have a dataset where I want to see effect of a drug on my patients who responded and not responded towards treatment. I collected their blood at three different time point or visit. For each patient I have their age and sex information with me. Now to perform differential expression analysis I used DESeq2 to perform time series analysis as I have collected blood at three different visit. I want to control age and gender effect on my data so I can see interaction between responder group and different time point. Here is the sample table and my DESeq2 design formula:
sample Phenotype visit Age Gender
1 NonResponder 1 42 female
2 NonResponder 2 42 female
3 NonResponder 3 42 female
4 NonResponder 1 49 female
5 NonResponder 2 49 female
6 NonResponder 3 49 female
7 NonResponder 1 27 male
8 NonResponder 2 27 male
9 NonResponder 3 27 male
10 Responder 1 77 female
11 Responder 2 77 female
12 Responder 3 77 female
13 Responder 1 51 male
14 Responder 2 51 male
15 Responder 3 51 male
16 Responder 1 47 male
17 Responder 2 47 male
18 Responder 3 47 male
So which design should I use to control age and gender effect on my data
**design 1:
dds=(design= ~age+gender+visit+phenotype+visit:phenotype+age:phenotype+gender:phenotype)
dds=DESeq(dds)****
**design 2:
dds=(design=~age+gender+visit+phenotype+visit:phenotype)
dds=DESeq(dds,test="LRT", reduced=~age+gender)****
I will highly appreciate help with this
Best,
Lalit
Dear Swbarnes2, Thank you so much for the reply. Actually its just a small data set which i presented in this forum. I have 10 non responder patient and 9 responder patient. I collected their blood at three different time point or visit. So in total I have total 57 samples for three visit. I did PCA analysis using phenotype age and visit information but I did not see separate cluster. Variation between PCA1 and PCA2 was not that much. PCA1 18% and PCA2 was 9%. But I am not sure which design I should use to see genes where visit and phenotype have effect and they are not affected by age and gender. I want to correct this data for age and gender. Should I use design 1 as full model dds=(design= ~age+gender+visit+phenotype+visit:phenotype+age:phenotype+gender:phenotype) dds=DESeq(dds)
or should I use design 2 as reduced model to correct my data for age and gender dds=(design=~age+gender+visit+phenotype+visit:phenotype) dds=DESeq(dds,test="LRT", reduced=~age+gender)
Best Regards, Lalit