Hey guys,
I am quite new to this field and recently I have been asked for an exercise in which I get two different files from an NGS machine corresponding to the forward and the reverse sequence into a fastq format. The exercise consists of finding the mate reads in both files.
Should I just match the whole name on the different files using the sequence identifier, i.e, @SEQ_ID? Or is it a faster way to do it in which I do not need to match the complete name and only the run ID possibly (even if I do not know how the sequence ID is going to be given)?