Hi folks,
We got long reads sequenced from 10 bacteria using Pac Bio sequencing platform. 5 of them don't have reference bacterial strains and 5 of them have some bacterial strain closer to the subject.
I have to identify anti microbial resistant genes from these 10 bacteria. This is the first time, I am handling PacBio sequence.
Any assembler to handle long reads?As of now don't know the coverage of the sample. Guide me through a reference article if you have encountered for this requirement. I found HGAP from PacBio sequencing platform (https://github.com/PacificBiosciences/Bioinformatics-Training/wiki/HGAP#implementations).
Celera® Assembler link is broken
Check this recent review (Table 2 has lists of lots of useful programs).
There are numerous long-read assemblers available. Many listed here.
https://academic.oup.com/bib/advance-article/doi/10.1093/bib/bbx147/4590140
Do you only have PacBio data? You should get some Illumina:
On stuck records and indel errors; or “stop publishing bad genomes”
As of now, I have been told that I am going to get only the PacBio long reads. Why do you say that I should get some Illumina?
From the blog post I linked:
PacBio and Nanopore have insertions / deletions as main error, Illumina doesn't have many insertions / deletions, so you can correct PacBio errors using Illumina reads.