Extract Ensembl IDs column using a data file
2
0
Entering edit mode
5.7 years ago

Hi,

I have a normalized data file containing gene symbols as first column followed by various samples (normalized data). I was looking to extract the Ensembl IDs corresponding to the gene symbols. Basically, I would like to have both gene symbols and Ensembl IDs in the data file for further downstream analysis. Please assist me with this.

Current format

gene_symbol Sample_1 Sample_2

Revised format

gene_symbol Ensembl ID Sample_1 Sample_2

Best Regards, Toufiq

annotations ensembl ID RNA-Seq • 1.6k views
ADD COMMENT
1
Entering edit mode
5.7 years ago
c.chakraborty ▴ 180

Use bioMart from bioconductor, to extract the ENSEMBL IDs of your genes.https://www.bioconductor.org/packages/devel/bioc/vignettes/biomaRt/inst/doc/biomaRt.html

ADD COMMENT
1
Entering edit mode
5.7 years ago
Erin Haskell ▴ 470

BioMart is definitely the way to go. You can use the R package Bioconductor as c.chakraborty suggests, or the point and click web tool. We have a video covering ID conversion that you may find helpful.

ADD COMMENT
1
Entering edit mode

Thank you. This was helpful.

ADD REPLY

Login before adding your answer.

Traffic: 1775 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6