Any alternative for SSPACE-longreads or any suggestion to speed up
1
0
Entering edit mode
8.2 years ago
caizexi123 ▴ 60

Hi all,

I am using SSPACE-longreads to scaffold my genome (around 3G) with error corrected Pacbio data (10000000 reads), but the output is super slow, it seems the process will take 3 months. Is there any alternative to fulfill the same purpose? Or are there any suggestion to speed up, for example changing part of the code or any trick? BTW, I am using the default parameters.

genome Assembly next-gen • 3.3k views
ADD COMMENT
0
Entering edit mode

Did you try using more CPU by changing the -T parameter?

ADD REPLY
0
Entering edit mode

Yes, 8 threads. But, the blasr just take couple hours to complete. And I think SSPACE-longreads don't run multi-thread after blasr.

ADD REPLY
0
Entering edit mode

what about PBJelly’s? Each step in PBJelly’s workflow can be run on a cluster

ADD REPLY
0
Entering edit mode

I am trying PBjelly now. Waiting for the result. But what else I can try?

ADD REPLY
0
Entering edit mode

AHA is part of SMRT Analysis, it seems too complicate to setup. I will try after the PBJelly finish.

ADD REPLY
1
Entering edit mode
8.2 years ago

Maybe you can try OPERA-LG. It has a script for scaffolding using PacBio reads. Here is the paper https://genomebiology.biomedcentral.com/articles/10.1186/s13059-016-0951-y

It seems to be better than SSPACE-LongReads

ADD COMMENT

Login before adding your answer.

Traffic: 3747 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6