Hello all,
I was wondering if there are any recommendations on how to make Kraken2
work with Oxford Nanopore reads? It seems like errors are still very much influencing the accuracy of the assignment. So, what do people usually do? Use corrected reads? Make databases in different way?
Thank you in advance!
What kind of taxonomic assignment errors are you observing? I would expect the longer sequences would contain enough correct kmers - even with higher error rates - to allow generally correct taxonomic classification. The literature seems to indicate this as well, e.g.:
Benchmarking the MinION: Evaluating long reads for microbial profiling
Testing the advantages and disadvantages of short- and long- read eukaryotic metagenomics using simulated reads
However, this is just my "gut feeling", as I never used Kraken2 with long, error-prone reads.
I have a sample sequenced by both Nanopore and paired-end Illumina reads, and the difference is huge. I assumed Illumina is correct, thus I was a bit shocked by the difference.