UPDATE - I could workaround this issue by using the fastp tool as per Istvan's comment below. Im yet to find the root cause of the trimmomatic issue but will update once solved.
Hi all,
I have been using trimmomatic version 0.39 to trim some paired end reads I retrieved off SRA.
The reads have a high adapter content after a check using fastqc.
The reads themselves were prepared with TruSeq RNA Library Pre Kit v2 and sequenced on the Illumina HiSeq 2000, but unfortunately I can't get more information than that.
I have used the following command:
for sample in "${arr[@]}"
do
echo "running trimmomatic for sample $sample"
time java -jar ./Trimmomatic-0.39/trimmomatic-0.39.jar PE -phred33 -threads 24 ${sample}_1.fastq.gz ${sample}_2.fastq.gz \
${sample}_1_paired.fq.gz ${sample}_1_unpaired.fq.gz \
${sample}_2_paired.fq.gz ${sample}_2_unpaired.fq.gz \
ILLUMINACLIP:./Trimmomatic-0.39/adapters/TruSeq3-PE-2.fa:2:30:10 SLIDINGWINDOW:4:15 MINLEN:36
done
The error:
for the forward reads the trimming works fine and the samples come out clean, however there is very little to no adapter content removed from the reverse reads.
an additional piece of information is that fastqc identifies the forward reads as using the 'Illumina univeral adapter' while the reverse reads use the 'Illumina small rna 5' adapter'. I couldn't find any match for the latter adapter in any trimmomatic adapter files or in my troubleshooting. I have tried including the adaptor sequences from here in the adapter fq but it made little difference.
Any information or possible solutions would be greatly appreciated!
did you query the reverse reads with small rna adapter (sequence identified by fastqc)?