Hi,
I want to compare the accuracy of different trimming tools. I want to search for adapter sequence in the trimmed fastq file. I tried using grep command as mentioned below, but it's looking only for exact match. When I searched with part of the adapter sequence, I can see more results, and all the matches are near the sequence end, hence, I think they are most likely from adapter sequence.
$ zcat sample1_tool1_trimmed.fastq | grep "AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC" | wc -l
$ 28
$ zcat sample1_tool1_trimmed.fastq | grep "AGATCGGAAGAGCA" | wc -l
$ 143
Is there any tool that can search for adapter sequence in fastq file while allowing mismatch? Also, this is one time analysis, hence, it's completely fine if the tool takes more time to run.
And, thank you for your suggestion in advance. :)
see Filter a large fasta or fastq by a query sequence, with parameterized fuzzy matching