I have sequencing data from NextSeq and MiSeq, and the GC content increasing rapidly at 3' end base:
Is there something wrong with library? How did this arise?
I have sequencing data from NextSeq and MiSeq, and the GC content increasing rapidly at 3' end base:
Is there something wrong with library? How did this arise?
You didn't mention which one is NextSeq and which one is MiSeq, but I can guess that above is NextSeq and below is MiSeq.
Why NextSeq produces more G?
It's caused by the two-colour chemistry system. Four bases with two different colors:
As sequencing goes to last cycles, the signal strength decreases. So more bases will be detected as G incorrectly. That's why NextSeq produces more G.
Hi chen, is this a related issue: Can phasing or pre-phase during basecall cause indel? ?
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Do you see this also on miseq? On NextSeq this could mean that you sequenced through the adapter and nothing is left to sequence. It's two colour chemistry, so nothing means G. We see polyG stretches at the end. Can't explain it for miseq though, since AFAIK that still uses 4 colours.
Yes, miseq is the same. Insert sise is about 250, and PE150 is used, as you can see above (maybe not clearly), it is caused by the missing of A rather than ployG, G/C/T all goes up except A.
If the base quality at the 3' end is poor, then do not even consider this finding as valid. Have you produced a FastQC report of the reads? The 3' end of Illumina reads is notorious for being poor quality.
Yes, the quality of last cycle is quite poor, I have posted the quality distribution, do you have any idea what does last cycle do?