DMRs, how common are they and how long?
1
0
Entering edit mode
2.7 years ago
malonzm1 ▴ 20

Hi!

Does anyone know how common differentially methylated regions (DMRs) (or just methylated regions) are? Say, in a 10,000bp region how many DMRs can one expect?

Also, does anyone know about the distribution of the size/length of DMRs, i.e. how many CpGs are usually in a DMR? Or the usual range? E.g. is a region consisting of just 2 CpGs can be considered a methylated region?

Thanks and good day!

regions methylated methylation DMR differentially • 1.4k views
ADD COMMENT
1
Entering edit mode
2.7 years ago

That's like asking "how many differentially expressed genes do you expect?" That question cannot be answered without additional context, i.e. if I'm comparing cancer cells vs. normal cells, I expect a lot more differentially expressed genes than when comparing technical replicates of normal cells to each other.

GC content and CpG islands and so on are notoriously species-specific -- in mammalian genomes, if you look at random 10KB regions in gene-poor loci, you're going to find much fewer CpG clusters than when looking in 10KB regions covering multiple gene loci.

What exactly is it you're after? I.e. why are you asking these questions in the first place?

ADD COMMENT
0
Entering edit mode

Thanks. Perhaps it's more apt to use the term CpG clusters than methylation regions. In this case, the conditions are not relevant. Even a vague range would help.

I'd like to compare methods that detect methylated regions. Instead of using simulated data, I'd like to use actual bisulfite sequencing data. Knowing the method/s that return the most realistic number of methylated regions and the most realistic sizes would help in the comparison.

ADD REPLY
0
Entering edit mode

CpG islands in mammalian genomes are typically hypomethylated. I.e. detecting clusters of CpGs will not necessarily translate into detecting "methylated regions". What type of method do you have in mind anyway? Methylation itself is usually detected via bisulfite sequencing, i.e. the chemical treatment of the DNA. Are you trying to see how accurate the distinction between unmethylated CpGs (= 0% reads) vs. fully methylated CpGs (= 100% reads) are?

ADD REPLY
0
Entering edit mode

Thanks. I'm looking at an approach to detecting methylated regions that measures the correlation of methylation levels of neighboring CpG sites (CpG sites meeting a certain threshold of correlation are combined into a methylated region). However, this approach returns relatively short regions. This is why I'd like to know what the typical range of sizes of methylated regions are. I'd also like to know the range of how common methylated regions are to compare with the output of this method.

ADD REPLY
0
Entering edit mode

Maybe run another tool that does "methylation region detection" on your data and see what comes up?

ADD REPLY
0
Entering edit mode

Thanks, but how will I know how accurate this other tool is if I don't have an estimate of the "ground truth"?

ADD REPLY
0
Entering edit mode

You need to define "ground truth". What it is that you want to test? What is it that your tool is trying to solve?

ADD REPLY
0
Entering edit mode

Ground truth here would be the range of lengths of DMRs.

ADD REPLY

Login before adding your answer.

Traffic: 2899 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6