So there is my problem : I am searching for the average distance between a known motif inside sequence, and extend this to a list of sequences...
The first part is done, the second part (extend to a list of sequences) is the problematic one ! So, here the way i am doing the first part :
It's appear that the first sequence of my fasta list have an average distance of 152 nucelotide between two TCCA motif.
And, i don't know how automatize this to all my list in df...
The Rho function in the seqinr package calculate the indipendence of occurence of a k-words using the expected word count.
We can modify it to be run just for our target:
Look at apply() and its variants.
Thanks, i will check it !