grep pattern in both start and end, but with mismatch in the middle of sequence
0
0
Entering edit mode
2.2 years ago

Hi everyone

I'm looking for a method to perform grep in both start and end of a sequencing, but allowing one, two or three mismatch in the middle.

[conservative match] + match with maximum of three mismatch + [conservative match]

I tried grep from seqkit, but this tool allow mismatch in the entire sequence.

Best regards, Willian

grep mismatch • 1.1k views
ADD COMMENT
1
Entering edit mode

This may be what you are looking for: https://unix.stackexchange.com/questions/163726/limit-grep-context-to-n-characters-on-line

If these are fastq files then some creative combination of restrictright= and restrictleft= may work with bbduk.sh in filter mode. A guide is available here: https://jgi.doe.gov/data-and-tools/software-tools/bbtools/bb-tools-user-guide/bbduk-guide/

ADD REPLY

Login before adding your answer.

Traffic: 2121 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6