Entering edit mode
4.0 years ago
Morris_Chair
▴
370
Hello guys,
I want to find out which of about 500 genes of my interest have a defined consensus sequence in the promoter, I want thinking to create a bed file and look the peaks with IGV but that's gonna take forever, is there any faster way ?
thank you :)
Hi ATpoint,
I do have the PFM but how can I extract the promoter of all the genes? or how can I tell to FIMO to scan the genes of my interest?
thanks
You could get the TSS coordinates for the relevant genes from a GTF file and then simply take a window upstream of it, say 500bp which is often use as a proxy for the promoter. The coordinates then can be converted to fasta with
bedtools getfasta
, which then can go to fimo.Hi ATpoint, I used this used defined function for getting the the TSS , it seems pretty much straightforward to use
tss <- getTSSinUCSCFormatFromEnsemblGTF("Homo_sapiens.GRCh38.95.gtf.gz")
I assume I got many TSS for single gene which should be the IRanges, is it correct?
doing this promoter(), I can have a list of the promoters range, I think I can use this ranges directly without making TSS windows, what do you think?
thanks