Obtaining gene names for .CEL files
1
0
Entering edit mode
10.3 years ago
maria.kesa ▴ 30

Hello everybody,

My question is: how do I get a gene list using Bioconductor from a .CEL file for the HG_U95Av2 array(from the arrays at http://www.broadinstitute.org/cgi-bin/cancer/publications/pub_paper.cgi?mode=view&paper_id=75)?

I tried following the answer in this post, but I couldn't do it. I can't figure out how to use the annotation library I found in Bioconductor look for gahgu95av2)

I already figured out how to do normalization of the files using justRMA, now I need the gene annotations for the numbers.

Your hints and and answers would be really helpful:-) Thank you!

Cheers,
Maria

R gene-expression • 4.3k views
ADD COMMENT
0
Entering edit mode

Can you show how the gene ids look like?

ADD REPLY
0
Entering edit mode

1000_at, 1001_at etc:-)

ADD REPLY
5
Entering edit mode
10.3 years ago
komal.rathi ★ 4.1k
source("http://bioconductor.org/biocLite.R")

biocLite("hgu95av2.db")

library(hgu95av2.db)

Annot = data.frame(SYMBOL=sapply(contents(hgu95av2SYMBOL), paste, collapse=","), 
           DESC=sapply(contents(hgu95av2GENENAME), paste, collapse=","),
           ENTREZID=sapply(contents(hgu95av2ENTREZID), paste, collapse=","),
           ENSEMBLID=sapply(contents(hgu95av2ENSEMBL), paste, collapse=","))

Annot$SYMBOL will contain the gene names for your IDs.

ADD COMMENT
0
Entering edit mode

Thank you, that's perfect!!!

ADD REPLY
0
Entering edit mode

Thank you so much, Komal:-) I never had a bioinformatics course or worked with Bioconductor, so I was really confused:-) Thank you for your time!

ADD REPLY

Login before adding your answer.

Traffic: 1864 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6