Question

DNA methylation for certain region

0

Entering edit mode

7.1 years ago

Learner ▴ 280

I would like to know your opinion on finding methylation from a data set of HumanMethylation450. I have read few papers and R codes. I have a data with only one replicate (healthy against disease)

Is there any way to find whether methylation occurred or not? what threshold normally one should use to see if it is methylated or not (I mean the same prob)

genome • 1.7k views

ADD COMMENT • link updated 7.1 years ago by Kevin Blighe 88k • written 7.1 years ago by Learner ▴ 280

score 0 · Answer 1 · 2017-10-08

0

Entering edit mode

7.1 years ago

Kevin Blighe 88k

Hey Learner,

If you have β values, then it is generally accepted to conduct a Wilcoxon Signed Rank test on a probe-wise basis, and to then adjust the derived P values by FDR. The Wilcoxon test is a non-parametric test for comparing related or paired samples. It can be used in situations like, for example, comparing tumour versus normals (not necessarily matched). Also, aim to determine the difference between the mean β in the healthy samples and that in the disease samples.

The general cut-off for statistical significance should then be:

Passes 5% FDR
Greater than difference in means of absolute 0.15 (>|0.15|)

These can obviously be modified in term of stringency.

Kevin

ADD COMMENT • link 7.1 years ago by Kevin Blighe 88k

0

Entering edit mode

@Kevin Blighe thanks for your explanation. what do you mean by beta? if we have a prob, we have the expression for that do you mean I perform a linear relation between disease versus normal and get beta+xalpha ? can you please explain it a bit more thanks

ADD REPLY • link 7.1 years ago by Learner ▴ 280

0

Entering edit mode

No, the Beta value is a measure of the level of methylation at the probe site. Take a look here: Interpretation of Beta values : Methylation data

I am aware that 'beta' is also used in regression modeling, but I am not referring to that here.

Which values have you got? I would not use the term 'expression' for methylation, as it does not make sense. Methylation at a genomic locus actually represses transcription/expression at the locus in question.

ADD REPLY • link 7.1 years ago by Kevin Blighe 88k

0

Entering edit mode

@Kevin Blighe do you have any reference for the cut-off ? Also what if I want to see the methylation in general ? for example on disease probs and on healthy probs , is there any way that I see the differences ?

ADD REPLY • link 7.1 years ago by Learner ▴ 280

0

Entering edit mode

Hey, they allude to 0.15 as a threshold HERE. You'll see references to it scattered throughout the literature. I guess that it's a bit like 5% FDR and log2 fold-change of 2 in expression studies, which are what people generally go for but that are by no means fixed thresholds.

Yes, you can also of course just plot out the Beta values and visually compare. I did this once and 'randomly' selected 0.6 as a cut-off for high/low methylation.

ADD REPLY • link 7.1 years ago by Kevin Blighe 88k

0

Entering edit mode

Hey, I wanted to get back to you on this because I had been reading an interesting manuscript about Beta and M values. The authors concluded that they both exhibit a linear relationship in the range Beta 0.2-0.8, but that, outside of this range, M values are better for parametric statistics due to the fact that their distribution (across the 27k chip and their study samples) was more homoskedastic and therefore more in line with the assumptions of simple statistical tests, like the t-test.

However, in my recommendations to you here, I mentioned the use of a Wilcoxon Signed Rank test, which is non-parametric and therefore justified.

For more, see here: https://www.ncbi.nlm.nih.gov/pubmed/21118553

ADD REPLY • link 7.1 years ago by Kevin Blighe 88k