Entering edit mode
7.9 years ago
kk.mahsa
▴
150
i have 40x paired end (2*100) reads (genome size is approximately 2G) and I wanna de novo assembled them using ABYSS. i ran ABYSS on 32G RAM computer and it worked for 5 days without any results and i killed running due to some problems. i used ABYSS v 2.0.0. in abyss manual page mentioned that abyss v2.0.0 could handle human genome (76x) on 34G RAM while it can not done my project. why?
ABYSS, could give me good scaffold using paired end reads or it need mate end reads to do that?
thanks
mahasha
can you support any log data about at which phase was the software running? and yes mate pairs or other long reads like SMRT PacBio or nanopore will give you better results
used command was:
abyss-pe k=64 n=5 s=200 name=myscaffolds lib='pe1 pe2' pe1='read1_1.fastq read1_2.fastq' pe2='read2_1.fastq read2_2.fastq'
abyss reaction is:
Reading pe1='read1_1.fastq read1_2.fastq' ...
5 days and only Reading pe1='read1_1.fastq read1_2.fastq' ... !!!
is my command wrong?
add this as comment not an answer;
as you are using
did you tried to use
np=10
(or any number) where this controls number of cores used by the software, also try to usev=-vv
to know exactly what is happeningthanks Medhat, i will use np option and tell you about its result.
kk.mahsa : What hardware config do you have available on this machine?
not related to core usage
Processor: intel Core i7-4770k CPU@3.50GHz×8
CPU(s): 8
Core(s) per socket: 4
socket(s): 1
CPU family: 6
Thread per core: 2
I think you can run 10 normally
That CPU has 4 cores and 8 threads. So using 10 may not be optimal.
What is the size of your data files?
Please use
ADD COMMENT/ADD REPLY
when responding to existing posts to keep threads logically organized.sum of data (reads) is approximately 200G
In absence of any specific error messages only possibility is that the RAM you have is not enough for this amount of data. Do you also have at least 2x swap space allocated? While you did quote from Abyss site that quote must have some finer points that may not have been explicitly mentioned.
Can you take half of the data and see if the program makes progress? Keep an eye on
top
to see what happens to the memory. I am assuming you are the only user on this machine and nothing else is running (that can use memory)?is it possible that i assembled each lane separately and then merge them into one final file? there are any programs to do that?