Changing the Ensembl ID to scientific name, gene name, ensembl ID
0
0
Entering edit mode
5.4 years ago
Nata Ali • 0

I downloaded 100s of gene orthologs from ENSEMBL using FASTA format. I have headers that look a lot like this one:

>EMLSAP00000005133

or this one:

>ENN75927

I would like my headers to have the following format:

>Scientific_name_geneName_EnsemblID

Based on my research, I've seen that using the prefixes from the ensembl IDs, I should be able to find the scientific names. I have also seen that I could use BiomaRt in order to find the gene name. However, I am having trouble using BiomaRt in the command line and being able to automatically transform the hundreds of headers from the different files.

Could someone help me? Is there a way of doing all of this (scientific name and gene name) in the same step or is this approach the correct one?

gene • 978 views
ADD COMMENT

Login before adding your answer.

Traffic: 1658 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6