Hello,
I've been just inspecting my Fastq files a bit and I have a question about the index sequence.
Given a fastq sequence header as follows:
@A00181:639:HNTFMDSX5:2:1101:1018:1000 1:N:0:ACACTAAG+TTATGGAT
I understand that ACACTAAG+TTATGGAT
is a sequence index which differentiates samples on a flow cell. My first question is whether my understanding is right?
If so, wouldn't I expect that all reads in a given sample to have exactly the same sequence for their index? This is mostly the case except for example, I also see different indices here and there in the same fastq file such as ACACAAAG+ATATGGAT
. Why is that the case?
Thanks so much for your help!
This is great! Thanks very much! Each fastq file contains 1 sample and indeed, the difference is in 1 base. Thanks for explaining this very clearly!