Hi all,
I was wondering if there is a command that can be used to check if all the sequence names in the R1 and R2 fastq.gz files are paired and identical?
I've got some files causing issues where the error during bwa-mem says there are mismatched names. Which is strange because I've used this files months ago without any problems using the same program (only change is I've downloaded them onto a new cluster).
I would like to check if all my files have this issue but I don't see any commands in samtools that can check for this problem. If there is another program that can do this, please advice.
If they worked before and now, after moving them, they are not working, I would be concerned there's a deeper issue. Is the file corrupted/incomplete?
If they're not paired, seqkit pair could help to match up paired-end reads from two fastq files.
repair.sh
from BBMap suite will also re-pair the files.