Question

Get MACs2 peaks associated with specific genes

0

Entering edit mode

4.9 years ago

ahmed.saadawi • 0

Hi guys,

I have an ATAC-seq dataset of three conditions, each with three replicas. After running MACs2 within a custom pipeline, peak files (narrow, summit and size filtered) for individual replicas and merged ones were obtained. Now, I would like to get peaks info (.bed and .fasta files) associated with specific genes. The aim is to use those peaks to do motif discovery and enrichment and correlate transcription factors with genes of interest. For that, I will use the MEME suite, in particular CentriMo, and will try Homer also. My question is: how to get the .bed and .fasta file of accessible peaks associated with such genes?

Any advice on downstream motif analysis would be greatly appreciated.

Thanks a lot in advance!

ATAC-Seq Macs2 Peaks • 1.7k views

ADD COMMENT • link updated 4.8 years ago by Biostar 20 • written 4.9 years ago by ahmed.saadawi • 0

score 0 · Answer 1 · 2020-01-06

0

Entering edit mode

4.9 years ago

ATpoint 85k

narrowPeaks are in fact in BED format. Fasta files can be obtained with bedtools getfasta from it. Restriction on certain genes can be done with bedtools intersect if you have a BED file with the gene (or any other genomic) positions.

ADD COMMENT • link 4.9 years ago by ATpoint 85k

0

Entering edit mode

Thanks a lot! On more question: should I only consider the exact coordinates of the genes, or maybe add some flanking regions (e.g., 5kb upstream and 1kb downstream) to potentiate TF motif discovery and enrichment? I don't know if that would make sense, though!

ADD REPLY • link 4.9 years ago by ahmed.saadawi • 0

0

Entering edit mode

for homer: you just provide the coordinates of peaks. for MEME, you should extract fasta sequences with customer script or bedtools.

for proximal targets: actually, you should use H3K27ac and H3K4me3, or other histone modification markers ChIP-seq data to define potential promoter and enhancers, or other cis-regulatory element regions.

for distal targets: maybe you should take Hi-C (promoter-enhancer interactions) into account.

ADD REPLY • link 4.9 years ago by BenHu • 0

0

Entering edit mode

Depends what you mean with gene. If you mean the promoter I would take ATAC peaks that overlap like -500bp to +50 relative to the TSS of annotated genes.

ADD REPLY • link 4.9 years ago by ATpoint 85k