Hi I want to extract sequences after a motif say "TTTTTAAAAA" from a multifasta file. I do not want the nucleotides before this keyword. Is it possible to extract nucleotides between 2 motifs with grep? eg. nucleotides between TTTTTAAAA & AAAATTTT. I tried with grep but I need the fasta headers also. Can anybody suggest a solution in grep (if possible) or perl or python.
thanx raghul
You can get a case with the motif found several times within a same sequence. How do you want to deal with that?
Hello!, I would like to do something similar...did you find a way to complete your task?