Hi everybody,
to perform some analyses I need to extract the coordinates of all CG dinucleotides from the human genome (hg19). It would not be extremely difficult to write such a script but, for several reasons, I would prefer to rely on already existing tools or processed datasets.
However, I was surprised that I found nothing on Google or on data repository such as UCSC,...
Is there to your knowledge such a tool or a downloadable dataset?
Thanks in advance for your answers.
Thanks for the code Pierre. I coded something similar (in python) but I was more interested in published solution as a comparison.
I can upload the code on figshare if needed :-P
Hi Philippe and Pierre,
Did you ever find a solution to this? I am very interested in finding all the CpG sites in the human genome also?
-Diviya
Great script, Pierre.