Hi All,
I am completely new to NGS analysis. I just received data from a paired-end 125 bp Illumina run. I ran both data files through the FastQC software to check for data quality and received a few warnings/failures. In the graph Both runs failed the kmer content test. In the graph, all of the lines hover around zero until you get the right side of the graph (near 94-96) where all of the lines exponentially increase to about 9. Does this mean that there could be adaptors left over on the 5' end? I'm not sure exactly how to interpret this or what I should do about it. Thank you in advance for any advice/suggestions.
Thank you so much for the tips. The per base sequence content plot did issue a warning. The A/T lines overlap at around 30% and the G/C lines overlap at around 20%. I assumed this means that my genome is A/T biased?
Yes, from what you are saying it seems so. You can get a better idea buy looking into the 'Basic Stats' section. It gives you GC% as well.