I am using limma to look at a GSE microarray (GSE10918 to be specific). I am trying to figure out which design matrix to use before fitting the linear model and am having a hard time figuring it out. It seems from the phenoData, the GSM's are in this format (so they are already subtracted and log2 transformed):
gene1 - control
gene1 - control
gene1 - control
gene2 - control
gene2 - control
gene2 - control
What I want is to find the effects of gene1 and gene2 separately. I am wondering what the best design matrix is for this.
Would do please explain it in more details? are you going to transform the raw dataset of GSE for R package limma?
As descripted in the raw dataset of GSE10918 GSE10918_series_matrix.txt (ftp://ftp.ncbi.nih.gov/pub/geo/DATA/SeriesMatrix/GSE10918/), I think the value is log2(cy5/cy2).
You can find some background infromation about microarray analysis here (http://soybeangenomics.cropsci.illinois.edu/files/NSF_Web_Microarrayresults.pdf).