How can we identify if there are neighboring genes or overlapping promoter regions within the 1000 bp upstream of the transcription start site (TSS) that we have defined?
2
0
Entering edit mode
6 weeks ago
Cluster • 0

How can we identify if there are neighboring genes or overlapping promoter regions within the 1000 bp upstream of the transcription start site (TSS) that we have defined? Is it an issue that there might be other promoter regions when we aim to discover novel motifs involved in myogenesis.

We aim to identify transcription factor binding sites (TFBSs) in the promoter regions of genes across 40 hierarchical clusters, defining promoters as the area 1000 bp upstream of the transcription start site (TSS). Should we limit our analysis to genes where the nearest upstream neighboring gene on the opposite strand is more than 1000 bp away to avoid potential confusion with TFBSs from neighboring genes unrelated to myogenesis or will it not be an issue. And if we do have to limit how can we find out from a large set of promoter sequence data for more than 1500 genes.

TSS TFBS • 538 views
ADD COMMENT
0
Entering edit mode
6 weeks ago
Ming Tommy Tang ★ 4.3k

you may want to look into ChIPseeker bioconductor package and more generally the GenomicRanges package

ADD COMMENT
0
Entering edit mode
6 weeks ago

I'd recommend Bedtools for exploratory interval based work. https://bedtools.readthedocs.io/en/latest/content/example-usage.html

Deeptools might also come in handy for checking read distributions with bigwigs for Chip-seq etc - https://deeptools.readthedocs.io/en/develop/

ADD COMMENT

Login before adding your answer.

Traffic: 692 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6