Why split fastq files?
1
0
Entering edit mode
3.4 years ago
shpak.max ▴ 50

I've returned to the bioinformatics world after an absence of several years, so the following question may seem naive:

I've inherited scripts from a former colleague for an alignment pipeline. Most of it is straightforward, but I'm unsure about one of the preliminary pre-processing steps.

Namely, there's a perl script that splits the fastq files into subsets of reads set to some input argument value. If the alignments (using bwa) were parallelized, I could see the reason for doing this, but in the absence of parallelization, what is gained computationally by splitting fastq files?

fastq alignments ngs bwa • 824 views
ADD COMMENT
1
Entering edit mode
3.4 years ago
GenoMax 147k

One can align the pieces in parallel (using brute-force parallelization) followed by merging of the BAM files.

ADD COMMENT
0
Entering edit mode

Could you clarify what is meant by "brute-force parallelization" (as opposed to parallelization proper)?

ADD REPLY
1
Entering edit mode

Starting multiple alignment jobs using fastq file pieces. You can then merge the resulting BAM files to produce a final single file.

ADD REPLY

Login before adding your answer.

Traffic: 1800 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6