Searching for taxonomic sequence classifier tools that assigns taxonomic labels to reads, where the input is long reads (PacBio, ONT .....).
There is already tool that could work with long read (centrifuge
). But I would like to know if there are any other tools beside centrifuge?
Thanks
kraken? Although, with PacBio you might have a problem with erroneous k-mers.
Kraken supports short read as far as I can tell, beside that Centrifuge is faster
I don't think long reads would be an issue, it would just treat it as single-end fastq. When I look at Table2 in the centrifuge paper, kraken is about twice as fast as centrifuge, albeit with a much worse memory requirement (and of course the database creation takes ages).
Do not forget that long read comes with high error rate (15% for PacBio as example).
That's what I wrote in my first comment. I don't know about ONT, but of course Pacbio will introduce erroneous kmers due to their indel error model.