Question

how to unzip the files in batch?

2

Entering edit mode

9.5 years ago

flavobacteria ▴ 50

Hi guys, I know this question is dumb. How to convert the .fastq.gz into fastq in batches? For example, if we have 100 .fastq.gz, how can we use the command to let them to be .fastq then? Thank you

next-gen RNA-Seq SNP alignment • 55k views

ADD COMMENT • link updated 21 months ago by Ram 44k • written 9.5 years ago by flavobacteria ▴ 50

3

Entering edit mode

Why do you need to unzip the fastq files? In most cases it is better to keep them compressed. Most NGS tools can handle compressed files directly, and it is generally faster to read a compressed file than an uncompressed one.

ADD REPLY • link 9.5 years ago by Giovanni M Dall'Olio 28k

1

Entering edit mode

Look at a few "xargs" usage on internet or simply try gunzip *.fastq.gz

ADD REPLY • link 9.5 years ago by Ashutosh Pandey 12k

0

Entering edit mode

If you have LSF (replace bsub with qsub -cwd for SGE):

ls *.fastq.gz | xargs -i echo bsub gzip -d {} | sh

ADD REPLY • link updated 22 months ago by Ram 44k • written 9.5 years ago by lh3 33k

Ram · Answer 1 · 2015-05-13

9

Entering edit mode

9.5 years ago

arnstrm ★ 1.9k

GNU Parallel, FTW!

If you have access to a HPC cluster, open a interactive job session. In my case I have a 64 core node, so I do it as:

qsub -I -l nodes=1:ppn=64 -l walltime=1:00:00

then run gunzip on all 64 processors

parallel -j64 "gunzip {}" ::: *.fastq.gz

I'll be done in no time!

ADD COMMENT • link updated 22 months ago by Ram 44k • written 9.5 years ago by arnstrm ★ 1.9k

Ram · Answer 2 · 2015-05-13

7

Entering edit mode

9.5 years ago

venu 7.1k

$ gunzip *.gz

or

$ for f in *.gz; do gunzip $f; done

ADD COMMENT • link updated 22 months ago by Ram 44k • written 9.5 years ago by venu 7.1k

2

Entering edit mode

For parallel processing (assuming more cores than jobs)

for f in *.gz; do gunzip $f & done

ADD REPLY • link 9.5 years ago by 5heikki 11k

1

Entering edit mode

This would start as many jobs as the number of files, which is not an elegant way. Best way to do it is to use GNU Parallel as told by arnstm.

parallel --jobs <int cores> gunzip {} ::: *.fastq.gz

ADD REPLY • link updated 22 months ago by Ram 44k • written 9.5 years ago by GouthamAtla 12k

0

Entering edit mode

Awesome! Worked great on my 480 files

ADD REPLY • link 3.5 years ago by jeremieaauger ▴ 20

0

Entering edit mode

Isn't this the easiest way? =)

ADD REPLY • link updated 21 months ago by Ram 44k • written 9.5 years ago by dago ★ 2.8k

Ram · Answer 3 · 2015-05-22

1

Entering edit mode

9.5 years ago

tomc ▴ 90

Or don't.

pass them compressed if the tool handles it ,or decompress on the fly i.e.

zcat file.gz | whatever_tool

There are a bunch of utilities for processing gzip without explicitly writing out the unzipped file

http://www.nongnu.org/zutils/manual/zutils_manual.html

ADD COMMENT • link updated 22 months ago by Ram 44k • written 9.5 years ago by tomc ▴ 90

Ram · Answer 4 · 2015-05-22

1

Entering edit mode

9.5 years ago

Vivek Todur ▴ 60

Hi,

PIGZ could be potential solution, its parallel version of GZip and uses the multi threading out of the box with so many other parameters to tweak the performance. You can find one here: http://zlib.net/pigz/

You can simply run pigz -d *.gz to extract the .gz files

Thanks

ADD COMMENT • link updated 22 months ago by Ram 44k • written 9.5 years ago by Vivek Todur ▴ 60

0

Entering edit mode

pigz is indeed a very simple way to achieve such a task.

ADD REPLY • link 9.5 years ago by Manu Prestat 4.1k

Ram · Answer 5 · 2015-05-13

0

Entering edit mode

9.5 years ago

rtliu ★ 2.2k

http://stackoverflow.com/questions/16038087/extract-all-gz-in-a-directory-linux

ADD COMMENT • link updated 22 months ago by Ram 44k • written 9.5 years ago by rtliu ★ 2.2k

Ram · Answer 6 · 2015-05-22

0

Entering edit mode

9.5 years ago

ravi.uhdnis ▴ 220

It might do the job

gunzip '*.gz'

ADD COMMENT • link updated 22 months ago by Ram 44k • written 9.5 years ago by ravi.uhdnis ▴ 220

Ram · Answer 7 · 2015-05-29

0

Entering edit mode

9.5 years ago

cvu ▴ 180

gunzip *.fastq.gz will do the job!

ADD COMMENT • link updated 22 months ago by Ram 44k • written 9.5 years ago by cvu ▴ 180