Entering edit mode
6.3 years ago
Gene_MMP8
▴
240
I have been analyzing breast cancer mutations from COSMIC. One of the observations I made was, for cancer-causing mutations, there is an abundance of thymine nucleotides immediately adjacent(left side) to the mutation site. How can I validate this result? Can anyone point me to some published resource/tool that can help me prove the over-representation of certain bases flanking the mutation sites?
That's pretty granular information. Are you sure it's not confirmation bias? I think you're going to need a custom script for that.
Can you please elaborate? I have already tested that the abundance is statistically significant with a p value <10e-11. Are you implying on making a custom script that tests whether the abundance is statistically relevant?
Perhaps you need to do this: Take a) different cancer dataset(s) and/or b) a non-cancer dataset(s). Repeat the analysis. See if the correlation does not exist in those. Especially if you are looking to correlate it specifically to breast cancer.
I was indeed suggesting that you test its statistical significance, which you've already done.