I have a set of SNPs that are significantly associated with metabolite levels. In order to understand the enrichment pattern in ncRNA i used Fisher's exact test in R. I obtained the following results:
p-value = 0.0149
OR = 0.2248
CI = [0.0271, 0.8180]
Looking at the OR, ncRNA is depleted in my case. I would like to understand how the p-value and CI influence the interpretation? Since the CI does not include a 1, and the p-value is bellow a significance level - does this mean that ncRNA is SIGNIFICANTLY depleted? Or something entirely different?
I have other examples, for e.g. miRNA:
p-value = 0.7808
OR = 0.5851
CI= [0.0706, 2.1277]
Based on the OR, miRNA is also depleted. In contrast to ncRNA, its p-value is above a significance level and the CI includes a 1. Does this in turn mean, that miRNA is NOT significantly depleted, but is yet depleted? Could it be that the p-value and CIs should not be the focus of the interpretation, but only the OR should?
Thanks a lot for your time!
Thank you very much for this!
I'd like to know what is the connection between ncRNA being short and the "true" OR being somewhere between [0.0271, 0.8180] ? Can you please explain that.
Under the null hypothesis there is no connection between a SNP being in an ncRNA and it being connected to metabolite levels - SNPs are randomly distributed across the ncRNA and the "not-ncRNA" classes and any apparent pattern is just chance. However, because ncRNAs are short, if you scatter SNPs randomly they are very unlikely to fall in ncRNAs.