From GEO omnibus I have download raw intensities from a 450k methylation array (in a tab-delimited text format). However, there are duplicates of some of the probes:
example for Sample 1:
There are two rows for this probe, one for Address A and one for Address B:
- cg00000622 (Address = 11642304)
- cg00000622 (Address = 38691301)
However, each has a methylated and an unmethylated intensity
- methylated = 907 unmethylated = 10835
- methylated = 120 unmethylated = 735
How do I get this down to single methylated and unmethylated values for this probe? (so that I may calculate the beta value).
Is this a type I probe or is one of them a control probe?
Thanks for any help
This is a two-color array. You should get a background towards that platform first before diving into analysis, e.g. https://www.bioconductor.org/help/course-materials/2015/BioC2015/methylation450k.html
Thank you. But how does one arrive at a single intensity value for methylated and a single intensity value for unmethylated, for type I probes? For type II probes, the methylated is in the red channel, unmethylated in green. For type I, am I to interpret the red ("methylated") channel as having both a methylated and unmethylated signals? It's a bit confusing.
If red and green do not equate to methylated and unmethylated for type I probes, how does one calculate the beta value from this? Do you add both methylated signals = M, and add both unmethylated signals = U?