Greetings,
I've downloaded a Short Read Archive (SRA) experiment and dumped it to fastq.
~/tools/sratoolkit.2.4.2-centos_linux64/bin/fastq-dump -I --split-files --gzip SRR1514952/SRR1514952.sra
BWA mem is throwing and error when I'm aligning the mate pairs:
[mem_sam_pe] paired reads have different names: "SRR1514950.1.1", "SRR1514950.1.2"
[mem_sam_pe] paired reads have different names: "SRR1514950.2.1", "SRR1514950.2.2"
[mem_sam_pe] paired reads have different names: "SRR1514950.3.1", "SRR1514950.3.2"
I'm checking that the files aren't truncated and contain the same number of reads. Has anyone run into this problem before?
If you had chosen
-F
option whilefastq-dumping
the reads you would not have had to do this transformation. You will have recovered original Illumina format fastq headers.