Question

Per sequence GC content still not good after trimming

0

Entering edit mode

3.7 years ago

zouini6 • 0

I am analyzing NGS data for a prokaryotic genome and FastQC initially failed "Per sequence GC content". I then ran Trimmomatic on my Fastq files and after running FastQC on the trimmed paired dataset, the GC content is still not good. could someone tell me what is the reason for this?

The first .jnp shows the result I got before trimming and the second one is for after trimming.

Thank you

Per base GC content before trimming Per base GC content after trimming

GC content. trimmomatic. • 2.8k views

ADD COMMENT • link updated 3.7 years ago by GenoMax 151k • written 3.7 years ago by zouini6 • 0

score 1 · Answer 1 · 2021-09-24

1

Entering edit mode

3.7 years ago

GenoMax 151k

Is the organism you are working on GC rich? At this point in process I suggest that you move along with the analysis. If concerns crop up in dowstream analysis then you could revisit this to dig deeper.

There are no absolute failures with FastQC. Their appearence (red X) indicate that your data lies outside the defined bounds (which are for normal genomic sequencing).

ADD COMMENT • link 3.7 years ago by GenoMax 151k

0

Entering edit mode

I don't know if it's a rich GC organism. I am moving along with the analyses but I just wanna know what are the potential interpretation for this because I'm supposed to comment and each step (with the results) for my assignment.

Thank you

ADD REPLY • link 3.7 years ago by zouini6 • 0

0

Entering edit mode

A simple naive explanation can be that you have fragments in your data that contain more GC that most other fragments. That could simply be a by product of creation of libraries. At an extreme it could represent contamination by an unrelated genome.