Hi,
I previously assembled a fungal genome with Illumina Hi-Seq paired-end sequences. The assembly was ~ 32Mbp and was made up of ~ 400 contigs. I did not try to join the contigs into scaffolds. BUSCO determined that the assembly was ~98% complete based on the number of orthologs.
However, I just received PacBio sequences from the same isolates and want to use them improve the assembly and possibly close the genome. My question is asking if I should reassemble the Illumina reads and PacBio reads together using SPAdes or some other hybrid assembler, or if I should update the pre-existing Illumina assembly with PacBio using PBJelly?
Thanks in advance! Morgan
I don't know if you were able to solve your problem by now, but I'll never stop to cite that article : https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5100563/
They are presenting different strategy of assembly according to their coverage and also present a tool you may give a try to : quickmerge (able to merge two different kind of assembly to improve your results).
Notify me if this informations was of any use to you :)
I'm recently also stumbled upon this quickmerge tool and and am currently applying it to my data, I must say I'm a fan in the meanwhile , it runs really fast and the results are more than OK.
Glad to hear I'm not the only quickmerge fan :)
Add me to that list!
If you can hack around the scripts or compute the deltas using MUMmer 4, it becomes even more insanely fast.
On that line, you can also try CAMSA
interesting comment harish , I haven't gotten that deep in it yet. Care to share some insgiht on how to achieve the switch to MUMmer4 ?
EDIT: is it as easy as to point it to the location of the MUMmer4 binaries?
thx for the tip, will look in to it
Try everything (short summary of a much longer thing posted by @h.mon below) you can.