Hi everyone.
I have a bed file with N specified loci. Is it possible to look up what gene corresponds to a given loci in the bed file?
If yes, how can I do the request to the ucsc server not downloading whole genome?
Many thanks.
Hi everyone.
I have a bed file with N specified loci. Is it possible to look up what gene corresponds to a given loci in the bed file?
If yes, how can I do the request to the ucsc server not downloading whole genome?
Many thanks.
I assume you want all genes in a given locus. First, download the latest release of the appropriate GTF. You don't need the binary genome. This is what you can do using bash shell:
You can't convert that. A GTF doesn't have the actual sequences in it. It's just an annotation file with gene names, coordinates, etc. You can either the latest release for the annotations:
ftp://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/
or the earliest:
ftp://ftp.ensembl.org/pub/release-55/gtf/homo_sapiens/
Note that these are from ensembl and the instruction were meant for that. You can get something from UCSC over here: http://genome.ucsc.edu/cgi-bin/hgTables?command=start where you select the hg19 assembly, genes and gene predictions, region genome, track whatever names you want, and select GTF in the output format.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
How to identify which genes fall within genomic locations
N Closest Genes To A Given Location