Question

Affymetrix probeset IDs...

0

Entering edit mode

10.4 years ago

a11msp ▴ 120

Hi everyone,

I'm trying to look at the processed data for the following ArrayExpress entry:

https://www.ebi.ac.uk/arrayexpress/experiments/E-GEOD-57818/

The array design for this experiment is specified as "[MoGene-2_0-st] Affymetrix Mouse Gene 1.0 ST Array [transcript (gene) version]". (Is it not self-contradictory MoGene-2_0 / Mouse Gene 1.0?).

The problem is that IDs in the processed sample tables look like nothing identifiable not just by me, but also by DAVID's famous ID conversion tool:

ID_REF    VALUE
17364176    6.11692
17306532    4.83607
17450319    6.78837
17260236    4.67415
17513311    7.34779
17217651    6.41337
17380366    10.9677
17548117    9.29091
17225413    6.68588
17458514    10.9731

Does anyone have any idea what these IDs correspond to or there's a problem with these files?

Many thanks!

Affymetrix Microarray • 8.7k views

ADD COMMENT • link updated 3.1 years ago by Ram 44k • written 10.4 years ago by a11msp ▴ 120

0

Entering edit mode

Hello ,

what is the solution if i have a human data ?

Thanks

ADD REPLY • link 9.5 years ago by benhrif.oussama • 0

0

Entering edit mode

Replace "mogene20sttranscriptcluster" with "hugene20sttranscriptcluster" everywhere including the library

ADD REPLY • link 8.5 years ago by srinivasnvnk55 • 0

0

Entering edit mode

Depends on your chip. In the above question, Affymetrix MoGene-2_0-st was used. You can check what is the corresponding annotationdbi package for your chip here.

ADD REPLY • link 8.4 years ago by komal.rathi ★ 4.1k

Ram · Accepted Answer · 2014-06-24

5

Entering edit mode

10.4 years ago

komal.rathi ★ 4.1k

This format is of Affymetrix MoGene-2_0-st annotation data. You can get the corresponding Entrez ID, Gene Symbol & Ensembl ID using:

source("http://bioconductor.org/biocLite.R")
biocLite("mogene20sttranscriptcluster.db")
library(mogene20sttranscriptcluster.db)
Annot <- data.frame(SYMBOL=sapply(contents(mogene20sttranscriptclusterSYMBOL), paste, collapse=","), 
                    DESC=sapply(contents(mogene20sttranscriptclusterGENENAME), paste, collapse=","),
                    ENTREZID=sapply(contents(mogene20sttranscriptclusterENTREZID), paste, collapse=","),
                    ENSEMBLID=sapply(contents(mogene20sttranscriptclusterENSEMBL), paste, collapse=","))

Then you can use the Gene Symbol, Entrez ID or Ensembl ID (recommended) to do the DAVID analysis.

ADD COMMENT • link updated 3.1 years ago by Ram 44k • written 10.4 years ago by komal.rathi ★ 4.1k

0

Entering edit mode

Thanks a lot, will try this out tomorrow!

ADD REPLY • link 10.4 years ago by a11msp ▴ 120

0

Entering edit mode

Perfect, it worked, thanks so much!

ADD REPLY • link 10.4 years ago by a11msp ▴ 120

0

Entering edit mode

Hi, i'm a R beginner. Can you please show me what should i do after loading those command line?

ADD REPLY • link 7.6 years ago by waiwaihuhuyiyi • 0