What sequencing coverage do you have for the PacBio reads? Are they from the RSII or Sequel?
What sequencing coverage do you have for the Illumina paired end reads?
What are the insert sizes for the mate pair libraries?
Facing some of the same issues myself these days and looking forward to read the responses. Currently we are using the following pipeline (eukaryotic genome):
1. Correct PacBio reads with LoRDEC using the Illumina reads
2. Assemble the corrected PacBio reads using Canu
3. Running Spades on the Illumina reads toghether with the Canu assembly as trusted contigs
4. Polishing the assembly with Pilon
you can give MaSuRCA a try perhaps?
or Canu with PacBio, any short read assembler on the illumina and combine/scaffold both afterwards with something like MeDuSa or such?
My best results were using Pacbio corrected by Canu and a hybrid assembly using the cleaned data from illumina and the corrected Pacbio in the Masurca software.
IMPORTANT! Do not use third party tools to pre-process the Illumina data before providing it to MaSuRCA, unless you are absolutely sure you know exactly what the preprocessing tool does. Do not do any trimming, cleaning or error correction. This will likely deteriorate the assembly.
What sequencing coverage do you have for the PacBio reads? Are they from the RSII or Sequel? What sequencing coverage do you have for the Illumina paired end reads? What are the insert sizes for the mate pair libraries?
Thanks for your answer jean.
PacBio data: Sequel and coverage 20X
Illumina data: Total coverage 50X and Insert size: 350, 550, 700 for PE and 3000, 5000 for MP.