I am trying to get the count of the matching pattren in the fasta file. i am having the fasta files which contains 57k sequences. i want to pull out the count of my matching pattern sequence. And required the starting position of the pattren
Input file:
How do we figure out what patterns are represented in that table? You are using CAGAT as the original question but it is not obvious in the results table in present form.
Would be better to update the code in the original answer by editing that since that is what people will try to use. This looks better. Can you fix the group names so the actual sequences the pattern is coming from are identified?
Thanks for the response, The link is not opening sir...
Ooops, sorry, try now
is this the way i have to give input
python code.py -f singleline.fasta -m 'CTGCAG'
It should be
It should be fairly well documented how to use it.
Thanks for your response , its working..