Question

Map gene IDs to gene names

0

Entering edit mode

7.9 years ago

bisansamara ▴ 20

Hi, I have a huge list of gene names, and I want to assign the ID for each gene. I tried the following code, and it kind of worked, but the problem was that it created extra IDs (more than the available genes) --> causing misalignment between names and IDs

CODE

library("org.Hs.eg.db")

a <- read.csv("filename.csv",TRUE,",")

y=as.character(a$Gene.refGene) #column name is Gene.refGene

gene=y

output = unlist(mget(x=gene,envir=org.Hs.egALIAS2EG,ifnotfound=NA))

write.csv(output, file = "output.csv") #write the IDs which is the output in a csv file

Is there any other way to get the gene IDs? or any suggestion on how to modify the code? your help is highly appreciated!

gene R Ensembl GeneID • 2.0k views

ADD COMMENT • link updated 7.9 years ago by EagleEye 7.6k • written 7.9 years ago by bisansamara ▴ 20

1

Entering edit mode

Some gene names have more IDs and vice versa. There is no way around it. One way to deal with it is to keep only one.

ADD REPLY • link 7.9 years ago by Benn 8.4k

score 0 · Answer 1 · 2017-06-14

0

Entering edit mode

7.9 years ago

EagleEye 7.6k

Simple solution would be, C: Transcript Id conversion

C: go term analysis with ensembl gene id

ADD COMMENT • link 7.9 years ago by EagleEye 7.6k