Question

combine affy HGU95Av2 and HGU133plus2

0

Entering edit mode

10.6 years ago

TriS ★ 4.8k

Hi all

I am working with data coming from two different platforms: HGU95Av2 and HGU133plus2. Ultimately I want to find differentially expressed genes.

I wanted to start from scratch using the CEL files but I want to find the best way to normalize them.

So far I'm:

1. Normalizing separately HGU95Av2 from HGU133plus2 using expresso

(bgcorrect.method="mas",normalize.method="quantiles", pmcorrect.method="mas",summary.method="medianpolish")

2. Match the probes across platforms using biomaRt, keep only those that match in both

3. Combine the data.

The boxplot I get is not awful but is not as pretty as I'd like it since you can see on the left side the samples from HGU95Av2 being at slightly lower intensity (link to boxplot here: https://drive.google.com/file/d/0BxzhXZ5eBptDMG03bWJXeE9sS2s/view?usp=sharing).

What would you guys suggest?

microarray normalization R • 2.6k views

ADD COMMENT • link updated 3.4 years ago by Ram 45k • written 10.6 years ago by TriS ★ 4.8k

Ram · Accepted Answer · 2015-01-20

1

Entering edit mode

10.6 years ago

Manvendra Singh ★ 2.2k

I think your plot is displaying intensities on log scale so you could remove probes with lower intensities. Now quantile normalize the data.

library(limma)
normalizeQuantiles(data)

Your data would be normalized as you want.

ADD COMMENT • link updated 3.4 years ago by Ram 45k • written 10.6 years ago by Manvendra Singh ★ 2.2k

0

Entering edit mode

I forgot that limma has a bunch of normalization options. I ended up using normalizaBetweenArrays(x, "quantile")

Thanks :)

ADD REPLY • link updated 3.4 years ago by Ram 45k • written 10.6 years ago by TriS ★ 4.8k

0

Entering edit mode

Yes, Quantile normalization is best when you intersect datasets from different platform/experiments

ADD REPLY • link updated 3.4 years ago by Ram 45k • written 10.6 years ago by Manvendra Singh ★ 2.2k

0

Entering edit mode

However, the limma manual says that it should be non-normalized data...so would it be correct to just use bg.correct() and extract those values for the normalization with limma?

ADD REPLY • link updated 3.4 years ago by Ram 45k • written 10.6 years ago by TriS ★ 4.8k