Tool:HFKReads, an ultra-fast and efficient tool for high frequency kmer sequencing reads extraction
0
1
Entering edit mode
9 months ago
Huiyang ▴ 190
  1. How to extract the sequences of organelle genomes from whole-genome sequencing data?
  2. How to extract high-abundance microbial sequences from metagenomic sequencing data?
  3. How to extract the sequences of highly expressed genes from transcriptomic sequencing data?

    We developed a software called HFKReads, which enables rapid extraction of high-frequency k-mer reads from sequencing data. To evaluate the performance and efficiency of this software, we extracted reads of the organelle genome from whole-genome sequencing data of plants. Approximately 95-99% of the nuclear genome sequences were effectively removed, and 1 Gb of reads could be extracted in less than 1 minute using a single-threaded approach.

    Using the extracted sequences, the assembly of organelle genomes showed a 10-20 times improvement in speed. Additionally, the quality of the assembly was significantly enhanced as most of the organelle genome sequences inserted into the nuclear genome were excluded. We hope that this software can provide valuable assistance to your research.

You can get the HFKReads code and manual on github here

kmer organelle metagenomic assembly • 341 views
ADD COMMENT

Login before adding your answer.

Traffic: 1714 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6