High CPU of GATK and Picard programs
0
0
Entering edit mode
3.9 years ago
ognjen011 ▴ 290

I understand that this might be difficult to answer without some debugging or trial and error, but I am wondering if some experience, intutition of Java knowledge can help.

I am processing some medium-large unmapped paired end Illumina BAM files (around 50G in size) with the Broad best practice pipeline, but I am using a local cluster 128 cores 528G RAM. I am scheduling tasks using SLURM in such a way that there are no hard limits to CPU or memory.

What sometimes happens is that some Picard metrics tools (which are assigned 7G RAM and unspecified cores) manage to take over 40+ cores or 15+ G RAM (IMAGE). In addition, they seem to take a long time.

Is this known behavior?

I am wondering if I constrain their resources with cgroups, will these tasks fail?

Is it possible that these tasks are somehow performing suboptimally due to their large resource availability?

If needed, I can provide more info. Any help is much appreciated!

Thanks!

TOP command

GATK java cpu RAM • 1.4k views
ADD COMMENT
0
Entering edit mode

Are you explicitly assigning RAM via --mem= option for SLURM? Same with cores (-n N)?

ADD REPLY
0
Entering edit mode

Yes, but I am not enforcing limits as of yet, just using that as scheduling tool. I plan to try that.

I am more interested in a collect metric tool hogging 40 cores. Especially since it is single threaded as far as I can tell.

ADD REPLY
0
Entering edit mode

Are these tools spawning sub-jobs on their own?

ADD REPLY
0
Entering edit mode

No. I use cromwell which submits them for me to SLURM, but SLURM jobs do not spawn other jobs.

ADD REPLY
0
Entering edit mode

Sounds like it's something to do with Java's garbage collection. Set the -XX:ConcGCThreads flag to 1 or 2 or something and see if it helps

ADD REPLY

Login before adding your answer.

Traffic: 1904 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6