I have a transcriptome data of 10.8gb R1.fastq and R2.fastq each. I then compressed these pairs using gzip R1.fastq
and gzip R2.fast2
, and now the files are 2.2gb and 2.4gb. Is it possible for two compressed files to have different size when the uncompressed files are of same size?
File sizes should never be used as quantitatve anything. Count the number of reads in both files if you want to be certain.
Thanks! I was submitting these pairs to NCBI sra and wanted to make sure this won't cause any problem.
As you know I had this problem last time with the SRA file where two files were asymetric. I just wanted to submit the compressed file this time. Yes the
wc -l
indicates same number for both filesUpload from a wired fast connection so there is no chance of corruption/interruption when doing the uploads.