How To Filter Out Sequences With No Hits From Bwa Sam Output?
2
3
Entering edit mode
13.4 years ago

If I run bwa bwasw like this:

~/src/bwa/latest/bwa/bwa bwasw -z 1000 ~/refs/human_g1k_v37.fasta cluster-14263.walks

I get back the positive hits from some of the sequences, but also the empty hits for the sequences that don't map anywhere:

cluster-14263-walk-35   16      4       44630869        0       3S3M3D10M2I6M9D18M1I12M5I22M2I4M26S     *       0       0       CAAGTACAGCTTTTCACTGAGGCATGGAATAAATATCCATTATGGCTCTGAGTGCCCCCAGAATGCACTGCGCACCAGCGCCATGTTTTAGCTCAAGCATAACAACTCCTGACC    *       AS:i:31 XS:i:31 XF:i:1  XE:i:0  XN:i:0
cluster-14263-walk-36   4       *       0       0       *       *       0       0       GGTCAGGAGTTGTTATGCTTGAGCTAAAACATGGCGCTGGTGCGCAGTGCATTCTGGGGGCACTCAGAGCCATAATGGATATTTATTCCATGCCTCAGTGGAAG        *
cluster-14263-walk-37   4       *       0       0       *       *       0       0       GGTCAGGAGTTGTTATGCTTGAGCTAAAACATGGCGCTGGTGCGCAGTGCATTCTGGGGGCAT *

What is the cleanest/recommended way of saving only the hits and not the sequences that have no hit?

Sorry if this has been answered before, I couldn't find an answer other than grepping on the resulting files.

bwa sam filter • 4.1k views
ADD COMMENT
6
Entering edit mode
13.4 years ago

The FLAG field has a bit for testing whether or not a read was "mapped". One can exclude unmapped reads as follows:

samtools view -F 0x4 aln.bam
ADD COMMENT
2
Entering edit mode

Careful, MAPQ == 0 indicates the read could not be mapped unambiguously to the ref genome.

ADD REPLY
0
Entering edit mode

~/bin/bwa bwasw -z 1000 $ref $reads | ~/bin/samtools view -F 0x4 -bt $ref.fai - > file.bam # Perfect! Thanks Aaron!

ADD REPLY
0
Entering edit mode

Good point drio. If you want to do "OR" logic such as this (-F 0x4 || MAPQ != 0), I would recommend bamtools' JSON "rules" files.

ADD REPLY
1
Entering edit mode
13.4 years ago
Docroberson ▴ 310

You can also use Picard tools. http://goo.gl/ndzYU Something like: java -jar picard/directory/ViewSam.jar INPUT=aln.bam ALIGNMENT_STATUS=Unaligned

ADD COMMENT

Login before adding your answer.

Traffic: 2366 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6