Question

Ideal data amounts and read depth generated by MinION sequencer

0

Entering edit mode

5.8 years ago

xusijiamed • 0

Hey guys, I'm new to nanopore sequencing and have many silly questions. I really need your knowledge. I use MinION sequenced a library containing DNA molecules mainly about 10kbp long, the flowcell I used is an old one, with about 700 active pores according to the platform check. I ran the device for about 4hrs, and got 15Gb fast5 and 1.5Gb fastq data. I read that a flowcell could be used 48hrs or more and generate 10~30Gb data (Is that mean the fast5 data) ideally. This time, half amount of active pores, 4-hr-run, 15Gb data?? BTW, the data quality control by MinIONQC shows the quality score of most data are >7, lookes acceptable... 1. Is this normal? 2. If my purpose is to cofirm some pathogenic variants in one target gene, how much read depths usually needed. I think that matters the sequencing time. This time I chose to run 4hrs but without any idea about this. Could anyone give some suggestion? Thank you very much!

nanopore sequencing MinION fast5 data amount • 4.9k views

ADD COMMENT • link updated 5.8 years ago by WouterDeCoster 47k • written 5.8 years ago by xusijiamed • 0

score 1 · Answer 1 · 2019-02-28

A better place for these question might be the nanopore community forum, assuming you have access. This is also not exactly bioinformatics, and in that sense, SeqAnswers could be more appropriate, but I don't know for sure if you'll get an answer there.

Anyway.

I use MinION sequenced a library containing DNA molecules mainly about 10kbp long

Did you use enrichment, or is this just genome sequencing? If the latter, what's the genome size?

The flowcell I used is an old one, with about 700 active pores according to the platform check.

That's indeed a suboptimal flowcell, but might be enough to generate data for your experiment.

I ran the device for about 4hrs

Any reason why you stopped then? Except if you use barcodes or wash the flow cell you're better off to keep one sample per flowcell (to avoid contamination).

and got 15Gb fast5 and 1.5Gb fastq data. I read that a flowcell could be used 48hrs or more and generate 10~30Gb data (Is that mean the fast5 data) ideally.

No, now you are confusing gigabytes with gigabases. The size of your files (in bytes) is not important. The amount of data, expressed in (giga)bases is what you should look at.

the data quality control by MinIONQC shows the quality score of most data are >7, lookes acceptable... 1. Is this normal?

7 is the standard cut-off from ONT to consider reads "good" or "not good", but depending on your application also low-quality reads may be valuable.

If my purpose is to cofirm some pathogenic variants in one target gene, how much read depths usually needed.

If you want to confirm variants which you expect to be there then I would consider about ~20x coverage to be enough. If you are doing de novo variant calling without knowing what to expect you would need more.