I have inherited an old dataset, but the details of what file corresponds to what exactly has been somewhat lost in the mist of time
I've managed to decipher most of the details, but there's one niggling issue I wanted to check
Each folder I've received corresponds to a single sample, and each contain 2X fastq files, with identical file names except for a "1" and "2" in the file name. My initial thought was paired end data, but there's no reason to have used PE in this context, the person who collected the data doesn't think it is, and the headers in the fastq are all number "1" in the pair so I don't think that's it
Here are example headers from the fastq:
file 1:
@D00261:443:CBE42ANXX:1:1104:1199:2080 1:N:0:ACAGTG
file 2:
@D00261:443:CBE42ANXX:2:2201:1155:2032 1:N:0:ACAGTG
So if my interpretation is correct, it seems they were generated in the same sequencing run, in different lanes. Is this typical? I haven't come across it before. Do/did Illumina machines spit out fastq files split by lane?
I'm hoping I can assume that because the run and index are the same, the files are definitely from the same sample
Any help is very much appreciated