Different versions of the same type of Affymetrix Array in GEO?
2
1
Entering edit mode
9.4 years ago
ad ▴ 30

Hi, I'm looking through data on GEO from affymetrix arrays and using NetAffx to determine what probes I should search for based upon the platform they claim to use. (for example [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array or [HG-U133A] Affymetrix Human Genome U133A Array). What I'm finding though is that the data in the Soft files from GEO is often missing a large number of probe ids NetAffx says they should have especially of things like lncRNAs. Not only that but GEO datasets using the same platform are often missing probe ids the other has. Are there different versions of the same type of array, and would it be described in GEO somewhere I could check? Or is it just standard practice to truncate GEO datasets in some way?

microarray affymetrix • 3.6k views
ADD COMMENT
0
Entering edit mode
9.4 years ago
matt.newman ▴ 170

I'd consider just looking at the raw data (CEL files) when possible and going from there. I wouldn't necessarily trust the already normalized data being uploaded by the users, because its really to their discretion what they did to it. We've done a ton of curation work on our OncoLand (TCGA and more) (http://www.omicsoft.com/oncoland-service) and ImmunoLand (http://www.omicsoft.com/immunoland) which pull heavily from GEO and ArrayExpress, and we found it was best to just go back to the unnormalized Affymetrix data when possible

ADD COMMENT
0
Entering edit mode
9.4 years ago
Ahill ★ 2.0k

Are there different versions of the same type of array, and would it be described in GEO somewhere I could check?

Ad,

Yes, the designs you mention have different numbers of probes, and represent different generations of the HG-U133 family of designs. The "A" array was one half of a pair (A and B) and the _Plus_2 more or less combined the A and B designs into a single array. GEO Array platforms are described here: http://www.ncbi.nlm.nih.gov/geo/browse/?view=platforms

For each platform the full list of probe IDs is provided.

HG-U133_Plus_2 here: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL570 (54675 probes)

HG-U133A 2.0 here: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL571 (22277 probes)

ADD COMMENT

Login before adding your answer.

Traffic: 2660 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6