Is there an existing tool to verify that a FASTQ file and a BAM file possess the same reads?
Is there an existing tool to verify that a FASTQ file and a BAM file possess the same reads?
Use BamHash
designed for this exact operation.
Thank you! Do you happen to know if this tool is meant to take in processed or un-processed bams as input? Also, the result of this tool produces two separate hashes - should both hashes match when comparing to another file? Or is it sufficient for only one of the hashes to match?
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Maybe you can convert from bam to fastq
bedtools bamtofastq
and compare the result with the fastq file usingcomm -12
you have that will gave you the reads that exists in both files. Count them and compare the number with each file that will gave you an idea If they are the same.Are you trying to confirm that the fastq and bam are an exact match, or is spotchecking a few entries good enough? Would agreement in the # of reads be sufficient?