Hi,
We have one fastq file, which we would like to split into three smaller fastq files. This could be probably done with the split command ( and a multiplier of 4).
But what we would like to do is create 10 times triplicates of this one fastq file. So I would like to know if there is a way of splitting a fastq files randomly and still keeping the four lines structure of the fastq file.
Another way to do it is to just use split on the fastq file, thank shuffle the order of the reads and split again. Is there a way to re-order the reads in a fastq file randomly?
Thanks in advance for any idea.
Assa
Thanks for the script. It seems to work, though I am getting an error after a few minutes.
AS the fastq files is zipped, this is the command I'm using:
After a few minutes I am getting a chunk size massage
But than the script stops without any errors, but only with the traceback massage:
Is it a memory problem? I hope you can help
Thanks, Assa
I updated the script just now (to use izip in place of zip). Give another try.
NO it is still not working. I can run it with the unzipped files, but not with the gzipped ones. I can't understand why.