I have 4 factors in the design formula in DESeq2
dds <- DESeqDataSetFromMatrix(countData = dfinput0,
colData = sampleTable,
design= ~ condition+age+sex+type,condition:type )
I got the error:
Error in checkFullRank(modelMatrix) :
the model matrix is not full rank, so the model cannot be fit as specified.
One or more variables or interaction terms in the design formula are linear
combinations of the others and must be removed.
This is my coldata:
condition type age sex
1 control z 57 Female
2 control af 72 Male
3 control ac 22 Male
4 control ab 71 Female
5 control ab 71 Female
6 control at 68 Male
7 control ba 81 Male
8 control ba 81 Male
9 control bh 68 Male
10 control bh 68 Male
11 control am 81 Male
12 control am 81 Male
13 control an 79 Male
14 control ao 83 Female
15 control ap 71 Male
16 control ap 71 Male
17 control aq 77 Male
18 control aq 77 Male
19 control ar 59 Male
20 control as 67 Female
21 control au 80 Female
22 control au 80 Female
23 control av 77 Female
100 treated e 48 Male
101 treated e 48 Male
102 treated e 48 Male
103 treated b 32 Female
104 treated b 32 Female
105 treated f 45 Female
106 treated f 45 Female
107 treated a 68 Male
108 treated a 68 Male
109 treated j 64 Male
110 treated j 64 Male
111 treated i 69 Male
112 treated i 69 Male
113 treated h 45 Female
114 treated c 59 Male
115 treated c 59 Male
116 treated g 75 Female
117 treated g 75 Female
the type column which represent biological replicates within the condition cause this error. Each sample is coming from human samples and for some human samples there are 2 or 3 sample, for others there is only one sample per human. How can I include the biological replicate information without having this error to DESEq2 analysis?
You'll want to use the
collapseReplicates()
function in DESeq2, see https://rdrr.io/bioc/DESeq2/man/collapseReplicates.html.That function is only intended for technical replicates, which is not the case here.