Hello all,
I'm trying to combine two datasets for analysis and had some questions about setting up the design matrix. I'm trying to account for differences between two batches, two conditions, and technical replicates. The target list looks something like the following:
Sample Condition Batch Individual
1 1 1 1
2 1 1 2
3 1 1 3
4 1 1 4
5 1 1 5
6 1 1 6
7 1 1 7
8 1 1 8
9 1 2 1
10 1 2 1
11 1 2 2
12 1 2 2
13 2 2 9
14 2 2 9
15 2 2 10
16 2 2 10
17 2 2 11
18 2 2 11
As you can see, there are both technical replicates within batches (all individuals in batch 2 have replicates), as well as replicates between batches (individuals 1 and 2 are present in both batches).
I can't create a model matrix as is, presumably because the variables are not linearly independent. Is there any way to develop a consistent model matrix to account for all these variables, given that not all individuals have replicates?
Thanks for your help,
-Adam