Hi all.
I would like to know the hardware requirements for de novo assembly for a relatively small plant genome (around 800 Mb). I have seen older posts but with the new assemblers and the improvements they have had in the last few years, I think the run times and RAM requirements may have improved. Until now, I have considered several de novo assembly algorithms for PacBio data: Canu, Flye, SMARTdenovo, wtdgb2. Do you have any idea how much time-consuming will be to try different pipelines with those algorithms to obtain the best assembly? I will also use other tools like Quickmerge or Pilon to correct the assembly with the reference (or assemblies generated by other algorithms).
Additionally, I would like to know your opinions about the coverage that the sequencing service has offered to us. They offered around 15x with Pacbio HiFi. However, according to what I have read, the minimum recommended coverage is usually around 20x. The plant has a reference genome and they will also sequence our samples with high-quality short fragments from Illumina. Could problems arise due to the coverage suggested by the sequencing service? The purpose of the study is to get a better understanding of different varieties of the same plant. Thank you in advance.
Hi Peter. Thank you for your response! I think we can not get more coverage, at least, for now, due to our budget. But, in case it is strictly necessary we could ask for more. That is the reason why I was considering the use of Quickmerge with the draft assembly and the reference genome published. Additionally, we will get Illumina reads to correct the assembly. Do you think that the low coverage of PacBio reads could still be a problem?