Extremely high ChIP-seq peak at mouse Chr2 "Gm10800" region
1
0
Entering edit mode
16 months ago
Ri ▴ 30

I was dealing with some CUT&TAG ChIP-seq data(for TF).

After mapping and peak calling, I tried to use IGV to observe their peak distribution around the genome, and I found there are a a great number of peak/reads mapped to a specific region of chr2.

In gencode.vM21.primary_assembly.annotation.gtf, this region is described as "Gm10800".

I didn't found a lot information about Gm10800, so wondering if it's resonable and why?

Thanks in advance!

IGV image

ChIPseq peak TF chipseq • 650 views
ADD COMMENT
3
Entering edit mode
16 months ago
ATpoint 86k

It overlaps a known artifact region that is prone to constantly accumulate excessive reads in NGS experiments. You can get blacklists for various genomes from https://github.com/Boyle-Lab/Blacklist/tree/master/lists which is based on this paper (https://www.nature.com/articles/s41598-019-45839-z). I always remove any peaks overlapping the blacklist in my preprocessing pipeline to remove these artifact regions from my peaksets.

ADD COMMENT

Login before adding your answer.

Traffic: 1501 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6