Best tools for genome assembly with paired-end, mate pair and linked read (10x) data

0

Entering edit mode

6.1 years ago

hirad.alipanah ▴ 10

Hi everyone. We have the following table for our data: Data Table

We want to assemble the genome of the planaria "dugesia japonica". Its genome is approximately 1.5Gbp, diploid and very repetitive (similar to Smed.) What is the best assembler that you suggest for assembling all of this data?

genome assembly Assembly • 2.1k views

ADD COMMENT • link 6.1 years ago by hirad.alipanah ▴ 10

0

Entering edit mode

Could you clarifiy on which species you want to perform the assembly ?

ADD REPLY • link 6.1 years ago by Nicolas Rosewick 11k

0

Entering edit mode

Yes. the planaria "dugesia japonica"

ADD REPLY • link 6.1 years ago by hirad.alipanah ▴ 10

1

Entering edit mode

Could you edit your question to add these information + expect size of genome + ploidy , etc... Thanks

ADD REPLY • link 6.1 years ago by Nicolas Rosewick 11k

0

Entering edit mode

10x data may need to be handled separately. supernova is what you would want to use there. I see some data from GAII which would lead me to believe that you have collected different data over time. Are all these datasets for the same exact sample/organism?

ADD REPLY • link 6.1 years ago by GenoMax 147k

0

Entering edit mode

Yeah, that was our first option. But how do we use the output of supernova for our other data? Yes, the datasets are from the same exact organism. But they are from different samples.

ADD REPLY • link 6.1 years ago by hirad.alipanah ▴ 10

0

Entering edit mode

I can recommend ABySS , very versatile, excellent cluster usage and quite performant, it might require some parameter tweaking though (as with most assembly software). From the same developers there are also tools to include the 10x data.

ADD REPLY • link 6.1 years ago by lieven.sterck 15k

0

Entering edit mode

Yes. We've considered that, too. But it needs a lot of memory (around 1TB but we only have 500GB.) Do you know other assemblers that require less memory?

ADD REPLY • link 6.1 years ago by hirad.alipanah ▴ 10

0

Entering edit mode

perhaps soapDeNovo is an option ? (no experience with myself though). Masurca will likely also be too mem intensive. I think in most cases you still need to figure out how to include the 10x as there are very few (to none?) software that will be able to process all your data at once.

ADD REPLY • link 6.1 years ago by lieven.sterck 15k

Login before adding your answer.