Entering edit mode
11.3 years ago
user
▴
950
how can one find out what version of Ensembl UCSC is using when you download the ensGene table from UCSC? Is it guaranteed to be the same for each genome? For example once a new genome, say hg19, is released, does it use a fixed version of Ensembl and stick with it even with new Ensembl releases (based on hg19) are made? thanks.
We're currently on Ensembl 72, so that's not the latest version. To get the latest Ensembl data, I would recommend getting it from Ensembl.
I stand corrected. "Data last updated: 2013-05-02", so I guess version 72 is pretty recent and UCSC will update soon.
that's helpful but how do you get it programmatically from goldenPath ftp? if you download http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/ensGene.txt.gz for example on a given date how do you find its associated Ensembl version? I don't want to go through the tables interface with the browser
Well, since version is not recorded in any of the ensGene-related files at the FTP site, the answer is that you cannot get it programmatically. Best you can do is use "wget --timestamping" so as to at least know when the file was updated, then I guess compare that date somehow with dates of Ensembl releases.