Hi guys. I'm beginner of bioinformatics.
In order to evaluate binding property of a compound which can selectively bind short DNA sequence in genome, I want to locate short sequence, for example "5'-GAATT-3'", from fasta file (genome) and make a heat map counting the number of the short sequence in windows. How should I do?
I'm not good at ruby or perl, and I'm glad if you give me solution in python, R or Linux command line.
Hi ,
If i resume you want to : -step 1 : "grep" all GAATT you have in you genome fasta -step 2 : note the coordinate of motif matching ? -step 3 : make a plot of you motif along your genome fasta ?