Question

Extracting matching reads by read ID

0

Entering edit mode

3.6 years ago

Friederike 9.0k

What tool would you recommend to compare two BAM files and extract matching reads by read ID?

BAM • 2.6k views

ADD COMMENT • link updated 3.6 years ago by GenoMax 152k • written 3.6 years ago by Friederike 9.0k

1

Entering edit mode

Without extracting read names, doing the comparison outside the BAM? filterbyname.sh from BBMap would be an option. You can come up with a clever way of using pipes/process redirection. May post an example later.

ADD REPLY • link 3.6 years ago by GenoMax 152k

0

Entering edit mode

Mostly looking for performance-savvy solutions (and general inspiration if there's not a specific tool that would do it)

ADD REPLY • link 3.6 years ago by Friederike 9.0k

0

Entering edit mode

duplicate:

Extract the alignments from a Bam file by name of the read

Efficiently Extracting Reads With Specific Names ('Queryname') From .Bam File

How To Extract A Subset Of Reads In Fastq Using An Id List?

....

ADD REPLY • link 3.6 years ago by Pierre Lindenbaum 166k

0

Entering edit mode

well, to be fair, I was mostly searching for a clever way to actually compare two BAM files directly, but it seems I'll have to go via extracting the read names first and then use those for subsetting (which is well covered in those posts)

ADD REPLY • link 3.6 years ago by Friederike 9.0k

score 2 · Answer 1 · 2021-11-30

2

Entering edit mode

3.6 years ago

GenoMax 152k

samtools view file1.bam | awk -F "\t" '{print $1}' | sort | uniq  > names_in_file1

filterbyname.sh -Xmx4g in=file2.bam names=names_in_file1 out=file.fq.gz include=t

file.fq.gz will include reads that are common in both files.

ADD COMMENT • link 3.6 years ago by GenoMax 152k

0

Entering edit mode

nice, except that I'd prefer a BAM file in the end, but I think that's an option for filterbyname.sh

ADD REPLY • link 3.6 years ago by Friederike 9.0k

0

Entering edit mode

Correct. You can simply use out=filtered.bam.

ADD REPLY • link 3.6 years ago by GenoMax 152k

score 1 · Answer 2 · 2021-11-30

1

Entering edit mode

3.6 years ago

GenoMax 152k

There is this: https://genome.sph.umich.edu/wiki/BamUtil:_diff

@Pierre also seems to have tool for this: Comparison between .bam files

BAM file comparison

ADD COMMENT • link 3.6 years ago by GenoMax 152k