Hi
I wanted to know if it is possible to use clumpify.sh
in a streaming fashion - like one would do with bwa mem .... | samtools
I would like to demultiplex with bcl-convert
and on the fly, I want to use clumpify.sh
.
Thanks in advance.
Hi
I wanted to know if it is possible to use clumpify.sh
in a streaming fashion - like one would do with bwa mem .... | samtools
I would like to demultiplex with bcl-convert
and on the fly, I want to use clumpify.sh
.
Thanks in advance.
I don't think you can do that though @Brian will perhaps provide an official word. Limitation may be with bcl-convert
. bcl-convert
writes output to files (not sure if you can make to write to STDOUT). Then there will be the issue of paired-end data. clumpify.sh
will read from STDIN
(which I tested) but the data may need to be interleaved when streaming since there is only one STDIN.
This is correct. Clumpify will accept stdin and work fine as long as all the data fits in memory, but it doesn't fully stream since it needs to load every single read before it can start deduplicating or sorting.
For example:
bcl-convert | clumpify.sh in=stdin.fq interleaved out=clumped.fq.gz
...that assumes input is interleaved and I don't know whether bcl-convert does that, but it might, since I think all of our files come out of demultiplexing already interleaved...
There's also the issue of multiple output streams... I don't personally run the bcl tools but if you had multiple output streams at once, I don't know how you'd pipe them all. One at a time is fine though.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
tagging: Brian Bushnell