Entering edit mode
12.5 years ago
Davy
▴
410
Hi All,
So I'm trying to basically get a list of all Entrezgenes and their corresponding gene names, and transcript ids. I've tried biomart ensembl and UCSC genome browser but had no luck with either. Ensembl gives me lots of missing rows of data, and ucsc doesn't appear to offer any entrez ids at all. Anyone know where I could download, or query a table of all entrezgenes?
Excuse my ignorance, but any suggestions for the best way to deal with these files? I was looking into parsing them with python, but that seems very DIY for something that should be relatively routine.
I would use the ASN1 files, convert it to XML using asn2xml ( http://www.ncbi.nlm.nih.gov/IEB/ToolBox/XML/ncbixml.txt ) and parse it with a (python-based) sax parser