I have a number of DNA sequences. Each sequence is either up or downstream of an SNP. How can I compute the presence of transcription start site from it? Thanks for your insight.
I have a number of DNA sequences. Each sequence is either up or downstream of an SNP. How can I compute the presence of transcription start site from it? Thanks for your insight.
You could map your sequences to a model genome assembly (e.g., hg38
for human genome, etc.) with a tool like Bowtie2, which recovers intervals where those sequences map, and associate them with genomic annotations like Gencode, Ensembl, CAGE, etc. using a tool like BEDOPS bedmap
, which can give you strand and offsets from which to derive the position of the TSS within the original input sequence.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
What is the organism your working on....??
Welcome to biology, where we do appropriate experiments to deduce features like this, and where not everything can simply be computed.
Experimentally speaking, can I use an already identified dataset of TSS containing its locations and accordingly check if it falls within the range of my dna sequences?
Yes - but ideally you would use TSS determined in your tissue of interest as this level of regulation can be dependent on the cell type.
I'm interested in the brain tissues. Which folder here contains that information?