Hi all!
I want to merge consensus motif to degenerate motif as below,
– consensus motif
ACGGGTA
GCGGGTC
ACGGGTC
GCGGGTA
– degenerate motif
RCGGGTM
{G|A}CGGGT{A|C}
I googled it but didn't find the answer.
Can you tell me some algorithm or principle of it?
For example , I have a list of consensus motifs
motif1 ACGGGTA
motif2 GCGGGTC
motif3 ACGGGTC
motif4 GCGGGTA
motif5 AGGGGTC
motif6 GCGGGTT
The degenerate motif can be
method 1.
motif 1-motif4 , motif6 RCGGGTH
motif 5 AGGGGTC
method 2.
motif 2 , motif5 ASGGGTC
motif 1 , motif 3-4 , motif 6 RCGGGTH
Which method is right?
Thanks.
EDIT : My question come from using K-mer to search DNA seqence , I want to clustering consensus motif to degenerate motif like the example above.
Your edit did not change the question, my solution does produce the degenerative sequence with ambiguity codes as you requested. If you have something else in mind when you say "clustering" you will have to elaborate on that and make it clear (or create a new post if you want something other than what you indicted above).
"clustering" - I mean "How similar motifs can merged together ?" I'm sorry that my question is not clear. As you wish, I marking the question answered. Thanks for your help.