Question

How to rerun Guppy Basecalling (with both fast5_pass and fast5_fail)

0

Entering edit mode

24 months ago

Ludwig Kian Soon • 0

Hi, I recently started a nanopore sequencing and used the live-basecalling feature in mk1c. It was proceeding very slowly. I want to free up the machine to do more sequencing and am thinking of recovering the raw data in queued reads and basecall it elsewhere, but find out that only a newer version of guppy is available there, and when I tested just a single passed fast5, the output generated is different (even though the model used, the min_qscore and other parameters as much as I am aware of are the same), so my question is:

How I can rerun the basecalling using the newer version of Guppy with the fast5 that were been split into pass and fail?
The GPU has got some memory constraint, I tested that I can still do basecalling without demultiplexing at one go, should I do demultiplexing after basecalling with fast5 recovered from raw or the other way around?
Is there a way to combine the fast5 split into pass and fail into those "raw" fast5?

guppy Nanopore • 2.6k views

ADD COMMENT • link 24 months ago by Ludwig Kian Soon • 0

1

Entering edit mode

For the question about rebasecalling both pass and failed fast5; isn't it just a means of setting the input directory as the place with the both pass and fail fast5 directories and making it look recursively for fast5 data (--recursive)?

I am not sure I understand your question about demultiplexing.

For combining the fail and pass fast5, it can just be combined into a single directory, the only difference between the fast5 files is that the resulting fastq score for the reads were below an average quality threshold. Not sure I understand the question.

ADD REPLY • link 24 months ago by samuel.a.odonnell ▴ 590

0

Entering edit mode

Hi samuel.a.odonnell,

Ah, I see; I didn't know it'd be that easy. When I tested the sequences in the re-basecalled fastq file is slightly shorter than the sequences in originally base-called fastq (even though I am using the same model), I thought the fast5 moved into the pass/fail folder underwent some sort of processing. Reading back at the manual, I see now that's not the case.

Regarding the demultiplexing part, I am doing sequencing 24 samples with the barcoding kit, and there's an option in guppy_basecaller to automatically split the basecalled data to different folder according to the barcodes. Just looked at the command line parameters, I think it comes after the guppy_basecaller.

Thanks for your help!

ADD REPLY • link 24 months ago by Ludwig Kian Soon • 0