I obtained some samples of amplicon sequencing (16s, V4) from a sequencing facility where they have trimmed the adapters & etc, "ready to use". However, I was wondering why, in the fastqc report, I would have the initial drop of quality (5 bps), as it's in the figure:
In your case a "technical explanantion" is likely the same sequence at the beginning of each fragment/amplicon. Illumina sequencers are designed with expectation of an equal distribution of A/C/G/T at any sequence position/cycle. Having low nucleotide diversity (all clusters glowing the same color) affects the Q scores resulting in a drop. This is the reason phiX is recommended as a spike-in with amplicons to normalize the distribution of nucleotides.
Thanks GenoMax! In that scenario, would that mean that, by looking at the overrepresented sequences, it should have the same 5 bp start in all of them right? But that is not what happens...
Thanks GenoMax! In that scenario, would that mean that, by looking at the overrepresented sequences, it should have the same 5 bp start in all of them right? But that is not what happens...
Is there a particular reason you are concerned? Phred score is > Q30 for those bases.
That said 50+% of your reads have the same sequence.
Just being sure that it is nothing wrong with the samples. Thanks!