Question

How to get SNP 6.0 array data in R?

0

Entering edit mode

8.8 years ago

agicict ▴ 200

Hi.

I currently downloaded a number of SNP datasets from GEO (CCLE)

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE36138

Also I downloaded Genomewide affymetrix 6.0 SNP array annotation data from affymetrix site.

http://www.affymetrix.com/support/technical/byproduct.affx?product=genomewidesnp_6

I imported the SNP 6.0 CEL files using affy package's ReadAffy.

Strangely, every CEL files has approximately 7,000,000 rows but affymetrix annotation data has around 900,000 rows.

What is right way to match probes?

R SNP annotation • 5.9k views

ADD COMMENT • link updated 2.3 years ago by Ram 44k • written 8.8 years ago by agicict ▴ 200

0

Entering edit mode

You could use the crlmm Bioconductor package

ADD REPLY • link 8.8 years ago by Jan Oosting ▴ 920

Ram · Accepted Answer · 2016-01-26

5

Entering edit mode

8.8 years ago

Henrik Bengtsson ▴ 80

The GenomeWideSNP_6 chip type has 6,892,960 probes (never changes). These probes are arranged in probe sets ("units") corresponding to 934,946 bi-allelic SNPs and 946,371 non-polymorphic single-probe CN loci. The exact number differ slightly between genome builds.

Check out the aroma.affymetrix R package, cf. http://aroma-project.org/. It has several ready pipelines for GenomeWideSNP_6, especially for copy-number analysis.

/Henrik
(author of aroma.affymetrix)

ADD COMMENT • link updated 4.9 years ago by Ram 44k • written 8.8 years ago by Henrik Bengtsson ▴ 80

0

Entering edit mode

Thank you for your kind explanation.

ADD REPLY • link 8.8 years ago by agicict ▴ 200