How to extract reads which contain specific sequence from fastq file
0
1
Entering edit mode
5.6 years ago
hsu ▴ 40

Hi, I wanna extract some reads which contain my interest sequence information from fastq file, but these sequence information I interested is in the middle of a read or start of a read. How could I extract these reads?

Thank you!

fastq reads extract • 6.2k views
ADD COMMENT
3
Entering edit mode

If not only looking for exact matches, you can try seqkit grep: seqkit grep --by-seq --max-mismatch 1 --pattern "ATCGAAG" test.fq

ADD REPLY
2
Entering edit mode

Use bbduk.sh from BBMap suite in filter mode. User guide here. Add literal=sequence_you_are_looking.

ADD REPLY
1
Entering edit mode

What are these sequences, and why not extract them with the grep function?

ADD REPLY
0
Entering edit mode

Are you are looking for an exact match of your specific sequence? What have you tried?

ADD REPLY

Login before adding your answer.

Traffic: 1882 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6