Is there a way to compute a distance between two PWMs using a substitution matrix? (for genes, prots or arbitrary alphabets doesn't matter)
Say I have PWM1 corresponding to a Motif in a certain set
PWM2 is the same for another dataset
I have a substitution matrix of the alphabet I am using
-------->
What are the possibilities to compute a distance between those two?
This would be the way if I had not a substitution matrix. I really cannot see how to take into account those similarities between characters since all these methods actually assume that every character is only similar to itself, or am I wrong?
You could use your substitution matrix to weight the differences in PWM1-PWM2. Alternatively, have a look at the method in this paper: Pape UJ, Rahmann S, Vingron M. Natural similarity measures between position frequency matrices with an application to clustering. Bioinformatics. 2008 Feb 1;24(3):350-7.
I'll have a read, thank you!