I would like to preprocess the microarry dataset GSE9006 which is Gene expression in PBMCs for children with diabetes. The array platforms are two, Affymetrix Human Genome HG-U133A and Affymetrix Human Genome HG-U133B. I need to combine two platform to increase the number of samples (n) and then analyse them for geting differential expression for each genes is it enough to download the data and normalize the expression matrix for each platform then merge them according to Gene Entrez? Thanks a lot for your cooperation
Have a look at this previous question,
what is difference between HG-U133A and HG-U133B array ? which one to use ?,
and at the Affymetrix documentation for the arrays.
The Affymetrix HG-U133A and HG-U133B arrays were a set, not different platforms, so you might want to check whether samples from each patient were run on both arrays.
See also this page:
https://www.ncbi.nlm.nih.gov/gds?term=GSE9006
@mastal511 I want to use both of them each platform contain (117 sample : 80T1D 24 Normal 12 T2D) I want to use all the T1D vs Normal samples which is (160 T1D vs 48 Normal). each platform has a defferent probes only 4478 common genes between them. and the expression level is deffrent as well.
I am new in the Bioinformatic I need a large number of samples to run my approach. How can i combain these set as you called them?
The 2 arrays form a set. The U133A array contained more well-known genes, and the U133B more probesets based on evidence from ESTs, and each array contained some 22K probesets.
Essentially there are 117 samples, presumably each run on both U133A and U133B, and in total, you have intensity measurements from some 44K or so probesets. So half the information from each sample is on the A array, and the other half is on the B array. This is not the same as trying to combine information from different experiments run on different technologies, like for example, Affymetrix expression arrays and Illumina expression arrays, which are designed in different ways.
Some genes will have several probesets assigned to them, either on the same array, or some on the A array and some on the B array, but the probesets will probably be looking at different parts of the gene (although in general most of the probes on those types of Affymetrix arrays targeted the 3' ends of the genes), or have been designed to target alternative transcripts produced from the same gene.
What you should do is look at which probesets are differentially expressed. The annotations (genes the probesets are assigned to) for the probesets may change from time to time, especially for probesets on the B array that were designed based on information from ESTs.
I am intersted in those 112 genes. As you see their FC values is differnet in each platform How can I combain U133A+U133B in one expression matrix
A similar question was just posted today: Regarding Microarray Platforms
Please take a look at my answer and see if it helps.
Thank you Dr kevin I will