Hi, I am new to RNA-seq data analysis and am currently trimming my fastq files with trimmomatic, however after trimming my results it seems that some features are getting worse, specifically the sequence length distribution, am I doing something wrong? Could this give me problems in my subsequent analyses?
I am attaching the command line that I am running. I will appreciate any help given.
java -jar trimmomatic-0.39.jar PE -phred33 input_forward.fq.gz input_reverse.fq.gz output_forward_paired.fq.gz output_forward_unpaired.fq.gz output_reverse_paired.fq.gz output_reverse_unpaired.fq.gz ILLUMINACLIP:TruSeq3-PE.fa:2:30:10 SLIDINGWINDOW:4:15 MINLEN:36
I'm confused it clearly got better... You trimmed adapters... from the ends of the reads right?
I remember the times when it was expected the quality of the base to be dropped up to 22 at the end of Illumina reads before preprocessing. What an advance in technologies!
ibq.enriquepola : Please do not delete threads once they have received at least one comment or answer. They provide value to future visitors. You can accept an answer (green check mark) to provide closure to this thread.