Entering edit mode
9.4 years ago
Shicheng Guo
★
9.6k
Dear Guys,
Can you share a perl/R/Python/Java script for CpGI, CpG Shore, CpG shelf regions bed file preparation?
Thanks
I tried to write a perl code to subtract 2000bp and add 2000 to the CpG island region to get CpG shore regions (55436 CpG shores) [Note: I removed chrUn, random and hap regions from hg19 genomes]. and then use bedtools subtract to remove CpG shores overlapped with CpG island (48790) and then merge the CpG shores which are overlapped within CpG shore regions (46817 CpG shore). Eventually, 46817 CpG shore region were obtained. and you can download this annotation: hg19.cpgshoreExt.txt
Similar idea, we can obtain genome regions for CpG shelf regions. 93634 regions were obtain from CpG shore files. 44844 remains after remove regions overlapping with CpG island, 41934 regions remained after remove regions overlapping with CpG shores and then 40812 regions remained after merge within CpG shelf. hg19.cpgshelfExt.bed can be downloaded.
Please elaborate on your requirement, and also give us some details on what you've tried.
Check this solution too,