Control gene size for gene enrichment study
0
1
Entering edit mode
8.7 years ago
michealsmith ▴ 800

I would like to test if genes containing at least one transcription factor (say MEF2A) binding sites are enriched for certain category.

I could easily come up with a TF-containing-gene list by intersecting TF binding sites bed files with gene annotation bed files, and send for enrichment study.

But question is: if one gene is big, naturally it tends to be more likely to contain TF binding sites. So should I first control gene size?

So I should normalize by assigning one parameter to each gene as: (overlap size)/(gene size) ? And then sort and select say the top 200 or 500?

gene enrichment GO • 1.6k views
ADD COMMENT
1
Entering edit mode

Is the transcription factor more likely to be biologically relevant when bound to promoters? If that's true, you could just restrict the overlap to TSS+/- 1kb which would generate fragments of equal length.

ADD REPLY
0
Entering edit mode

No, the TF bind to everywhere, which are all biologically relevant.

ADD REPLY

Login before adding your answer.

Traffic: 2399 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6