Compute resources for running ALLPATHS-lg
0
0
Entering edit mode
6.2 years ago
bio_d ▴ 20

Hi,

I am trying a denovo assembly of a non-model reptile using ALLPATHS-lg. I have 204GB of paired-end and mate-pair data.

Since we have limited computational resources we will be applying for computational resources (e.g from XSEDE). Could anyone suggest the compute hours and disk space in terms of Service Units(S.U) that should be necessary for completing the assembly process?

Thanks in advance.

Assembly sequence • 1.0k views
ADD COMMENT
0
Entering edit mode

What is the expected genome size? What is the proportion of paired-end to mate-pairs, how many sizes of mate-pairs? Do you know if the genome / sample being sequenced has high polymorphism rate?

Did you check the Assemblathon paper?

ADD REPLY
0
Entering edit mode

The genome size is approximately 2.6G.

We have 200 bp paired-end libraries and the following sizes for mate-pair libraries: 3kB, 5.2kB, 10kB and 20kB.

Unfortunately, I have not come across the Assemblathon paper.

ADD REPLY

Login before adding your answer.

Traffic: 1632 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6