Trimmomatic
1
0
Entering edit mode
22 months ago
Monia • 0

enter image description hereFastqc report

After I used fastqc, the final results appeared in this way. My question here is, how do I know if my readings are good enough and ready for the next stage (alignment), and should I make all the marks in the image green, and how do I use trimmomatic to get good results?

I use this bash script

#!/bin/bash

SECONDS=0

cd /home/tc/Project/Reads

# run trimmomatic to trim reads with poor quality
java -jar /usr/share/java/trimmomatic-0.39.jar SE -threads 4 /home/tc/Project/Reads/ERR1880946.fastq.gz /home/tc/Project/trimm/ERR1880946_new.fq.gz ILLUMINACLIP:/usr/share/trimmomatic/TruSeq3-SE.fa 2:30:10 LEADING:3 TRAILING:10 -phred33 SLIDINGWINDOW:4:15 MINLEN:33
echo "Trimmomatic finished running!"

and the result , my script is it right for my data or not?

Trimmomatic RNA quality-check • 1.4k views
ADD COMMENT
0
Entering edit mode

To my knowledge there is no one rule on trimming, it really depends on your requirements and questions. You don't give any additional context, so we cannot see if these parameters are appropriate. For example, are primers still attached, are these paired end, are the pairs meant to overlap?

Firstly, I recommend reading into error rates associated with phred scores, and understanding what each threshold means for your analyses. And whilst I could give you my recommendations here, I instead strongly suggest you first read papers with similar data to your study to see what QC they do, and use similar thresholds. Since these are NCBI accessions, what did the original study do?

ADD REPLY
0
Entering edit mode
22 months ago
jv ★ 1.8k

What kind of sequence library is this data from? bulk RNA-seq? What protocol was used to generate the library? These details are important for know how to interpret your FastQC results.

For now I will assume single-end bulk RNA-seq as that seems the best guess from your trimmomatic command details. A couple of thoughts:

  1. The pre-trimmomatic results aren't all that bad aside from the few cycles in which the quality score drops considerably.
  • I myself have rarely ever used LEADING or TRAILING when running trimmomatic. How about trying only ILLUMINACLIP:/usr/share/trimmomatic/TruSeq3-SE.fa 2:30:10 -phred33 SLIDINGWINDOW:4:15 MINLEN:50. This will mostly clean up the 3' end of the reads while keeping longer reads. Longer reads may map more accurately than shorter reads and thus may negate the drop in quality at cycles 19,20 and 49,50
  1. how do I know if my readings are good enough and ready for the next stage (alignment), and should I make all the marks in the image green

  • no, the quality scores don't need to be completely in the green, particularly at the 3' end of the read where quality "naturally" decreases
  • knowing whether the results are good enough depends on the type of data you have and what analyses you want to do with it
ADD COMMENT
0
Entering edit mode

Thank u for helping me ✨

ADD REPLY

Login before adding your answer.

Traffic: 2331 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6