Entering edit mode
6.0 years ago
lakhujanivijay
5.9k
I am trying to extract gene descriptions using Ensembl gene ids for candida glabrata using Ensembl BioMart website, however , when I select the species from the drop down
I encounter following error
Error link can be accessed here
What I have with me are the Ensembl ids like these
CAGL0L12386g
CAGL0M04477g
CAGL0M04851g
CAGL0M06193g
CAGL0M12452g
CAGL0M13299r
ENSRNAG049947011
ENSRNAG049947025
ENSRNAG049947039
ENSRNAG049947051
ENSRNAG049947068
ENSRNAG049947077
And I am interested in the corresponding gene descriptions. Any help will be appreciated.
Also tagging, Emily_Ensembl
Are those real Ensembl ID's? I tried 2-3 and can't seem to pull up any results via search.
Note 1: If you search at https://fungi.ensembl.org then you get hits.
Note 2: BioMart at https://fungi.ensembl.org does not have the species you need.
@Emily: REST API is not enabled for Fungi?
NCBI unix Utils gets the
CAG*
ids but not theENS*
since they are Ensembl specific.You can get fungi through the Ensembl Genomes REST API.
Certain species and strains in Ensembl Fungi are not available in BioMart – this occurs when we imported the data directly from INSDC.
I tried a few of those IDs and the search worked for me.
Thanks Emily_Ensembl
Could you please correct the API link which appears broken/incorrect?
Apart from the REST API, there is no other way you mean? I am not familiar with this. Also, generally the description is there in the GTF file, I have seen it seen it several times; I don't know why it's not there in this case in the GTF file.
Because the genome comes from the INSDC import, the description is only there if it was there in the original data in INSDC. We just might not have it – it wouldn't be available by REST either.
@Vijay: That link is just http://rest.ensemblgenomes.org
My source of GTF file was this and indeed these ids are there
That search looks like you're searching your genome for the name of your species. That's not going to work.