Entering edit mode
9 months ago
Bikal
•
0
When I did fastqc analysis on my original fastq files, the encoding that I can see in the biostatistics was sanger/illumina 1.9. But when I did the trimmomatic on those fastq files to trim low quality bases followed by fastqc analysis on the trimmed fastq files the encoding in the biostastics was Illumina 1.5. Why is it so? I made sure that the ASCII characters are not messed up with trimmomatic parameters. However, I am surprised to see the change in the encoding. My only concern is will it affect any subsequent downstream analysis??
Can you show us the trimmomatic command you ran?
I have used the following command for trimmomatic. Note: I am analyzing forward and reverse fastq files separately as they are result of amplification of a gene from several strains of the same mycoparasite collected across different regions. Each of my fastq files has only one read. I am using CROP tool to crop certain regions by trimming off bad quality region from all forward fastq files after looking at the multiqc of the original fastqc files. And similar for reverse fastq files too with only change in CROP parameter. To my surprise fastqc on the trimmed region gave me Illumina 1.5 for some of the samples (not all samples but some of the samples, 30-40% of samples, as in the picture below).
Pic one represents fastqc of a sample before trimmomatic. Second picture represents fastqc of the same sample after trimmomatic.