Hello all,
I have recently received raw FASTQ files for plasmids sequenced using Oxford Nanopore (long reads). The plasmid is around 6500bp of length.
First, I have run QC and found very long reads, much longer than the plasmid size.
Second, when I assembled the reads using Unicycler, I have obtained longer contigs than the plasmid length (which is expected due to the very long reads available).
Third, I have performed pairwise sequence alignment using Clustal Omega and found overlap between the contigs. Kindly check the following link that shows the alignments: https://github.com/abedkurdi/testing_shiny_app/blob/master/clustalo-I20250220-090531-0648-32923810-p1m.aln-clustal_num
I appreciate any guidance in this matter.
Thank you.
I forgot to mention that. I have already blasted the contigs and I am getting the top hits for "Pseudomonas Putida" for all the contigs and in all the samples. Do you think that this could be a contamination that is messing with the data?
Also, I got another batch of samples for other group of people, I also did blast and I am getting Mycoplasma as top hits.
At least the Mycoplasma sounds very much like contamination of the sample to me. Talk to the people who did the sequencing for you.
It could be you still have your plasmid contigs/genes among the contamination - search for them with blast etc
If your data has contamination with non-plasmid DNA then you would need to account for it, potentially prior to assembly. You should be able to bin the non-plasmid reads out.
I have run pLannotate on one of the samples and on the reference plasmid sequence. I noticed that by comparing the outputs, the features in the sample have multiple copies, while in the reference I have one copy per feature. Is that weird? Is it possible that I have "concatemers"?
Was the plasmid isolated before library prep? How was the lib prep done?
The plasmid was extracted using maxiprep kit from Qiagen. Regarding the library preparation, rapid barcoding kit from ONT (SQK.RBK.114.24) was used.
Any insights?
Do you expect the prep to be pure plasmid DNA?
It should be, right?