Bacterial RNA-seq GC%
1
0
Entering edit mode
13 months ago
fluentin44 ▴ 20

Hi,

I'm currently performing RNA-seq on bacterial samples (pseudomonas aeruginosa) and I'm getting the below per sequence GC content curve for around 10 of 30 samples. Essentially the (not shown here) other samples all gravitate around 65%, but there is a bump that gets bigger and bigger for different samples around the 55% mark and I'm struggling to determine what it is. The samples:

  • Are of great quality (also pictured below)
  • Have been trimmed with fastp, have a minimal length of 75bp and no apparent adapter contamination (according to fastQC anyway)
  • Have a higher sequence duplication level above ~500
  • All samples, including those with questionable GC content curve are mapping to the pseudomonas aeruginosa reference (self-assembly - by the group I'm doing this work for) at >=97%.

Does anyone know what it could be?

Good samples GC% Good samples GC%


Bad samples GC% Bad samples GC%


Good samples sequence dup Good samples sequence dup


Bad samples sequence dup Bad samples sequence dup


Good samples fastp GC% Good samples fastp GC%


Bad samples fastp GC% Bad samples fastp GC%


Good samples quality Good samples quality


Bad samples quality Bad samples quality


Thanks,
Matt

GC-content RNA-seq Bacterial • 581 views
ADD COMMENT
1
Entering edit mode
13 months ago
shelkmike ★ 1.4k

It's probably rRNAs. 23S rRNA and 16S rRNA in Pseudomonas aeruginosa have a GC content of about 53% (https://www.ncbi.nlm.nih.gov/nuccore/NR_076166.1 , https://www.ncbi.nlm.nih.gov/nuccore/NR_026078.1). Some of your samples have lower quality of rRNA depletion than others.

ADD COMMENT
0
Entering edit mode

Thats a very convincing answer, thanks very much!

ADD REPLY

Login before adding your answer.

Traffic: 1664 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6