Entering edit mode
13 months ago
ConvolutedGenome
▴
50
Hi,
I have a list of 100,000+ non-coding regions and I want to find noncoding regions that are recurrently mutated. Now my study is in melanoma, and melanoma tumours are known to be highly mutated at UV signature sites.
This mean that I HAVE to consider this as a potential covariates. The problem is that I don't know how :(
Here's what I'm thinking:
- Count how many UV signature sites I've got in each of these regions.
- Use this UV count as a sort of "feature" for clustering. My hope is that regions with similar UV site counts might behave similarly mutation-wise.
But here's where I'm stuck:
- How do I even find these UV sites? Is there a BED file or something out there that can help me figure out where these UV sites are?
Any help is much appreciated. Thanks you so much!