Entering edit mode
2.2 years ago
l.willianpacheco
▴
10
Hi everyone
I'm looking for a method to perform grep in both start and end of a sequencing, but allowing one, two or three mismatch in the middle.
[conservative match] + match with maximum of three mismatch + [conservative match]
I tried grep from seqkit, but this tool allow mismatch in the entire sequence.
Best regards, Willian
This may be what you are looking for: https://unix.stackexchange.com/questions/163726/limit-grep-context-to-n-characters-on-line
If these are fastq files then some creative combination of
restrictright=
andrestrictleft=
may work withbbduk.sh
in filter mode. A guide is available here: https://jgi.doe.gov/data-and-tools/software-tools/bbtools/bb-tools-user-guide/bbduk-guide/