Just a general question, often when running something like meme to search for de novo motif hits in my data the top few sequences I get back look like the following:
Motif 1 regular expression
--------------------------------------------------------------------------------
T[GC]T[GC]T[GC]T[GC]T[GC]T[GC]TGT[GC]T[GC]T[GC]T[GC]T
Wondering if there is an appropriate way to scan for enriched de novo motifs and avoid the repetitive stuff. I realize some TFs have these kinds of consensus sequences but I am assuming most do not. Thoughts?
Thanks.