Question

Transcription Factor Enrichment Analysis Using SEA of Meme-Suite

0

Entering edit mode

3.0 years ago

ilovesuperheroes1993 ▴ 40

I would like to run a transcription factor (TF) enrichment analysis using SEA of meme-suite. I have around 2000 query sequences in which I would like to find the enrichment. I have two questions regarding this process:

Should I use shuffled query sequences as control or is it better to provide random DNA sequences as control? To select random sequences, I have generated random genomic coordinates of lengths identical to my query sequences. Is this method correct?
As for the TF motifs, I have used the mononucleotide models (full) from HOCOMOCO database. I wanted to use the dinucleotide models, which are supposed to be more accurate, but only the mononucleotide models have the motifs in meme format.

Any inputs would be appreciated. Thanks

Transcription Enrichment MEME SEA Factor • 1.1k views

ADD COMMENT • link updated 3.0 years ago by Malcolm.Cook ★ 1.5k • written 3.0 years ago by ilovesuperheroes1993 ▴ 40

0

Entering edit mode

what process produced the 2000 sequences in the first place?

ADD REPLY • link 3.0 years ago by Malcolm.Cook ★ 1.5k

0

Entering edit mode

The 2000 query sequences are ChIP Sequencing peaks. I didn't mention it in my original post as it did not seem important to what I was asking.

ADD REPLY • link 3.0 years ago by ilovesuperheroes1993 ▴ 40

0

Entering edit mode

with more info about the upstream experiment and analysis that generated these peaks one might suggest that you

compare "good" (high-scoring) peaks v "bad" (low-scoring) ones.
compare peaks that are differential in your experiment v peaks that don't change
compare sequence under peaks against the same peaks "shifted" some number of bases in the genome (ensuring length profile is preserved)

ADD REPLY • link 3.0 years ago by Malcolm.Cook ★ 1.5k