Hi.
I need to identify restriction sites of a cDNA sequence in a fasta file. First, I transform the sequence from FASTA to a string and I have created a dictionary which contains recognition sequences and names of all restriction enzymes.
I want to identify these restriction sites using the dictionary... how can I do that?
Thank you.
Do your restriction sites have different length?
Yes, the sequence contains different restriction sites of different lengths
Then this is really hard to do because you don't know how much of your cDNA string you would want to use as a key for the dictionary (because of the different length). I would vote for trying something similar to what Torst is suggesting.
Thank you for your answer.