Hi,
I have got my first Nanopore sequencing data and the first step was to see if the data is good. Has anyone has any experience with this kind of data and can tell me how to interpret the results.
The whole report can be downloaded here (not sure how to post it here). Allin all it looks quite good to me, but what I'm not sure about are the two images attached here. These are the per base QC and sequencing content. It seems that the beginning of the reads is not good, but this can be probably trimmed by removing adapters etc. But Is it really that i need to remove the first 1000 positions? This seems a bit extreme.
About the sequencing content, I don't know where to begin. This looks consistent with the first image, where the quality is not good, and if removing these positions, it should get better. but is it ok for the two pairs TG and AC to be so much apart like that?
EDIT:
I have done both pycoQC as well as the minionQC run (R). They show similar results. It can be downloaded from here a few of the images are also attached.
To me it looks as if the run was past its prime after ~50h. There is no real gain of new reads afterwards. As I'm usually work with mRNA-Seq, I'm not sure how to call the PHRED Quality. I can see that the most reads are of Q ~=8 and most of them are short (which is also expected). But all in all can this run be classified as good?
thanks
Please use a program that is more suitable for QC'ing Nanopore data. PycoQC (LINK) and Nanoplot (LINK). And then show us what those reports look like.
I have edited the post and added results from pycoQC.
It would help to clarify what kind of data this is. You have an assessment from @Trivas below but in some contexts this may be an expected outcome.