I am currently analyzing the TCGA HNSC methylation data. I have split the probes to each gene, but I find out that for one gene, there will be several probes near that gene. I am wondering how could I get the methylation level to represent for that gene if there are mutli_probes near that gene.
I have read TCGA tutorial or other papers, they said that for genes with multi methylation probes, the probe with most anti-correlated is selected to represent the methylation level of a gene. But I am confused that, for methylation, we mostly care about the methylation in the promoter area. Shouldn't I only focus on the probes near the promoter area? So I am thinking about maybe I should get an average value for the probes near the promoter area to represent the methylation level of that gene. Is that Right?