Tool:Fast, flexible grep for very large FASTQ files
2
4
Entering edit mode
11 days ago
milesforjazz ▴ 40

Got a very large FASTQ file that you want to quickly filter for the presence or absence of multiple target sequences?

I'm happy to release v.1.1.4 of grepq, a fast and flexible way of grepping large FASTQ files.

Free, open-source and built in Rust.

Crates.io

https://crates.io/crates/grepq

GitHub

https://github.com/Rbfinch/grepq

grepq grep-like FASTQ • 571 views
ADD COMMENT
0
Entering edit mode

Might be worth adding a few of the tidbits from your README that emphasize the benefits of your tool over typical grep.

ADD REPLY
0
Entering edit mode
8 days ago
milesforjazz ▴ 40

grepq v1.1.6 is more than 1000 times faster than the venerable grep, and 11 times faster than ripgrep. Both grep and ripgrep, being general purpose tools, will search the entire file, including the quality scores. Occassionally, targets will 'lurk' in quality scores, leading to false positives.

grepq also has a tune subcommand to analyze matched target strings and update the number and/or order of regex patterns in your pattern file according to their matched frequency. This can further speed up the filtering process.

Check out the tool and README here:

https://crates.io/crates/grepq

If you've downloaded a previous version, updating is as simple as:

cargo install grepq

ADD COMMENT
0
Entering edit mode
4 days ago
milesforjazz ▴ 40

grepq v.1.1.9 released today, bringing JSON support for pattern file input and tune subcommand output, allowing named regex sets and named regex patterns. gzip support was added in v.1.1.8

Check out grepq and README here:

https://crates.io/crates/grepq

ADD COMMENT

Login before adding your answer.

Traffic: 2005 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6