How to find if a gene symbol is enriched in treatment vs control?
0
0
Entering edit mode
2.5 years ago
coboyfan12 • 0

I assume there would be a simple statistical test to perform this, but I can't find anything while searching.

For example:

My dataset has SNP variants that are associated with genes for n=25 "cancer" and n=25 "normal". If a gene has a variant the gene symbol is listed. So if patient five had 7 mutations in ATF4, that ATF4 would appear 7 times in the dataset for that instance.

I'm wondering if there is a statistical test or package in R to determine which genes are more "enriched" in each group.

R • 713 views
ADD COMMENT
1
Entering edit mode

What about genes that have no variant? You could look into Fisher Exact tests, but you'll need a way to define the universe of genes, i.e. all genes that were interrogated.

ADD REPLY
0
Entering edit mode

I could attain the list of all genes interrogated. But I'm not sure with Fisher Exact Tests how I would attain the specific genes that were enriched in each group?

ADD REPLY
1
Entering edit mode

well, you'd have

  • Mutation status: yes/no
  • Sample status: cancer/normal

Based on your description I felt you should be able to retrieve the number of genes for each one of those four categories.

One pesky detail may be whether to count unique instances of gene names (e.g. your 7 mutations of ATF4 would give only a count of +1 since it's all for the same gene) or not.

ADD REPLY

Login before adding your answer.

Traffic: 2933 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6