I am using IlluminaHumanMethylation450kanno.ilmn12.hg19
package to obtain the SNP-associated probe information (probe_rs, probe_maf).
The reference manual says
In addition to the SNP information provided by Illumina, we have added independent information on the overlap of the 450k with various versions of dbSNP. The overlap is based on the mappings of the array to the hg19 genome provided by Illumina. As dbSNP we have used the ‘Common’ table from UCSC (ie. ‘snp137Common’).
I also found Infinium HD Methylation SNP List from the Illumina webpage
But I am still not sure how the SNP-associated probe (or overlap) is defined.
Would it be something like SNPs within xx bp of the CpG site?
Any information would be appreciated.
How is full capture sequence defined?
The capture sequence is the genomic sequence that encapsulated the CpG site, and attaches to the oligonucleotide bead on the array.
Thank you very much
thanks, what does maf 0.05 imply? why should these CpGs be removed?
Minor Allele Frequency (MAF) - Measured differences in methylation could actually be a source of genetic variation in your population, thus if you want to minimise that bias then this function removes those probes.