Hello,
I'm wondering how to get the start and the end positions of all promoters in all human chromosomes?
Thanks
Hello,
I'm wondering how to get the start and the end positions of all promoters in all human chromosomes?
Thanks
At Ensembl we've annotated promoters as part of our regulatory build (shiny new paper on it). These are based on segmentation data from ENCODE and RoadMap Epigenomics, finding consensus regions of promoter activity between cell types. This will be further refined as we add more cell types to the analysis (e.g. more from ENCODE and RoadMap and add in Blueprint).
You can access these annotations through the Ensembl Browser (here is one at the 5' end of a gene, where we expect it to be), BioMart (e.g. this query will get you all the predicted promoters on chromosome 21), the Ensembl APIs and the Ensembl FTP site.
I do recommend that people read the paper and/or the documentation on how these were predicted. We don't have assayed evidence showing that these promoters will lead to gene expression when stuck on the end of a gene. What we have are ChIP-seq, DNase etc data that are indicative of promoter activity, ie it's very likely that if you stick it on the end of a gene with the right TFs it will be expressed, but no-one's tested to see if that's true.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
What is your definition of a promoter? A DNA motif, specific chromatin states, eg a region 1000bp upstream of the TSS?
Why does a Promoter have more than one definition?
Your are right, per se it's the place where the expression is regulated and mostly it is proximal to the transcription start site. But it can gene and tissue depended and more complex. So, one possibility would be to take -1000bp from the gene start to the TSS -1. Or you go further and use the Encode Data https://www.encodeproject.org/. They tried to find promoters using chromatin states, modified histones or proteins that interact with DNA specific for promoters.
if for example the start of a specific gene is 1345000 can we say that the promoter starts at 1344000 and ends at 1345000 for this gene?
No, you can only say that the promoter is probably within the segment.
Many thanks Jimbou, very useful informations.