I am analyzing NGS data for a prokaryotic genome and FastQC initially failed "Per sequence GC content". I then ran Trimmomatic on my Fastq files and after running FastQC on the trimmed paired dataset, the GC content is still not good. could someone tell me what is the reason for this?
The first .jnp shows the result I got before trimming and the second one is for after trimming.
Thank you
I don't know if it's a rich GC organism. I am moving along with the analyses but I just wanna know what are the potential interpretation for this because I'm supposed to comment and each step (with the results) for my assignment.
Thank you
A simple naive explanation can be that you have fragments in your data that contain more GC that most other fragments. That could simply be a by product of creation of libraries. At an extreme it could represent contamination by an unrelated genome.
Thank you so much