Splitting paired end bam files
1
1
Entering edit mode
9.5 years ago

Hello all,

Any efficient way to split the bam file to two separate files, say one file with only singletons(mate unmapped) and other with paired reads(no matter if it is discordant).

I have already used.

(i) samtools view -f 8 -F 4 -b foo.bam > foo.singletons.bam

(ii) samtools view -f 1 -F 12 -b foo.bam > foo.paired.bam

Is this correct solution to go ahead.

PS: This file has already been filtered for uniquely mapped reads using NH:i:1 tags.

RNA-Seq rna-seq next-gen • 4.1k views
ADD COMMENT
2
Entering edit mode

if you -f 1 to the second command, you should really -f 9 to the first to be fair :)
Even though its unlikely youll ever come across a bam file with both single reads and paired reads, the first command would match both. The single reads it would match would also be pretty random - not all of them.

ADD REPLY
1
Entering edit mode
9.5 years ago
Dan D 7.4k

That looks good to me. To save space, you might also try outputting to a named pipe:

outputting to a named pipe

ADD COMMENT

Login before adding your answer.

Traffic: 1437 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6