Hi Everyone,
How can I obtain a bed file with the TSS region? I am performing a chipseq analysis and to see the read covrage around the TSS using deeptools , I would like to have the hg38 TSS region .bed/gtf file. I googled a lot for this and found good solution for the hg19 assembly. It was hard for me to find the right solution for hg38 assembly. I kindly request someone to guide me to get the required files.
Thank you.
Satish
Thanks Devon, I will try this approach. I would also like to ask what kind of .bed file should I be exactly having to know the read coverage intensity of my chip seq around TSS. I am working chipseq on FOXO transcription factor.
You want at minimum a 6 column BED file, with entries for each transcript. There are easier ways of making that than what I showed to you, but if you wanted a BED12 file for future use (e.g., the next version of deepTools) then you'd already be good to go with it.
Devon, can you please provide two more information
Why two dollar signs used in $$3 != "gene" instead of just $3. I am quite curious to know whether $$ have any different meaning.
Also,if I am to get a bed file with TSS regions alone , could you please suggest any tweaks required with the given script. (I tried the given script, the output bed seemed to have lot more information than just TSS co-ordinates).
This was originally being run inside of a Makefile, so it needed to be escaped properly. That wouldn't be needed otherwise. I probably should have noted that when I posted this originally.
Devon, Thanks a lot for clarifying.
Regarding the other query, is there a robust way that you would recommend to get a bed/gtf file for TSS alone?
If you have BED file then it's a bit easier. It'd be something like:
For a GTF file it'd be something like:
If there are comment lines then they'd need to be removed, either with grep or directly in awk.
Thanks Devon. The two one liners provided would be of great help to me indeed.
Jf