How To Understand The Description Of Illumina Human610-Quad V1.0 Beadchip On Geo?
2
0
Entering edit mode
13.0 years ago
tflutre ▴ 580

I am interested in a genotype dataset on GEO that was obtained using the Illumina Human610-Quad v1.0 BeadChip (GPL8887). Thanks to the R package GEOquery, I can easily retrieve the data in R:

library(GEOquery)
gpl <- getGEO('GPL8887', destdir='.')
head(Table(gpl), n=2)
      ID Chr   MapInfo SNP_ID
1 200003   9 139026180 200003
2 200006   9 139046223 200006

But what does this mean?

  • Does the column SNP_ID correspond to the rs identifier in dbSNP? Apparently no, as rs200003 is not on chr9.
  • Does the column MapInfo correspond to the SNP coordinate? If yes, on which version of the human genome reference sequence is it mapped? hg18 or hg19?

I search on the Internet (on Illumina's web site, etc) but couldn't find any information. Any idea?

geo illumina genotyping • 6.9k views
ADD COMMENT
0
Entering edit mode

For info, this platform has 598,821 markers with a "rs" identifier: length(grep("rs", Table(gpl.geno)$SNP_ID))

ADD REPLY
0
Entering edit mode
13.0 years ago
Neilfws 49k
  1. Some of the values in SNP_ID do correspond to dbSNP, others do not. If you examine some of the later rows in the table, you'll see all kinds of identifiers used for probes with prefixes such as: rs, cnvi, and GA.
  2. Yes, MapInfo is the chromosomal coordinate and since this is an older platform (2009), hg18 is the genome version.

You might find it useful to download the PennCNV package and look at the file lib/hhall.hg18.pfb - it contains probe IDs, chromosomes and coordinates which should match most of those from your platform.

ADD COMMENT
0
Entering edit mode

@neilfws Thanks, I accepted your answer. But I wanted to mention that for my purpose the file "hhall.hg18.pfb" in PennCNV only contains information redundant with the data from GEO.

ADD REPLY
0
Entering edit mode

True. I only mention the file since it's derived from the Illumina platform metadata, to illustrate that the probe IDs are in fact "official" Illumina IDs.

ADD REPLY
0
Entering edit mode
13.0 years ago
Alexbmp • 0

I've looked up many sites of GPL8887 and it seems to be hg18 coordinates.

However, what I'm wondering now is, how can one be sure that GPL8887 is hg18; since I cannot find the documentation.

Does anybody know where to find the documentation about GPL8887 genome version information?

ADD COMMENT

Login before adding your answer.

Traffic: 1923 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6