Hi all,
I am trying to remove adapters and clean my RNA-seq.gz files using Trimmomatic, loaded on a Linux system (supercomputer server)
Following the steps for Pair ends reads, explained in the manual (http://www.usadellab.org/cms/?page=trimmomatic)
java -jar trimmomatic-0.39.jar PE input_forward.fq.gz input_reverse.fq.gz output_forward_paired.fq.gz output_forward_unpaired.fq.gz output_reverse_paired.fq.gz output_reverse_unpaired.fq.gz ILLUMINACLIP:TruSeq3-PE.fa:2:30:10:2:True LEADING:3 TRAILING:3 MINLEN:36
Since Trimmomatic figured out the argument, I assume Trimoomatic is loaded successfully, before starting the process!
TrimmomaticPE: Started with arguments:
T1R1-f.fastq.gz T1R1-r.fastq.gz T1R1-f_paired.fq.gz T1R1-f_unpaired.fq.gz T1R1-r_paired.fq.gz T1R1-r_unpaired.fq.gz ILLUMINACLIP:TruSeq3-PE.fa:2:30:10:2:True LEADING:3 TRAILING:3 MINLEN:36
java.io.FileNotFoundException: /*****/project/****/md/RNA/rna_data-bacteria/TruSeq3-PE.fa (No such file or directory)
at java.base/java.io.FileInputStream.open0(Native Method)
at java.base/java.io.FileInputStream.open(FileInputStream.java:213)
at java.base/java.io.FileInputStream.<init>(FileInputStream.java:155)
at org.usadellab.trimmomatic.fasta.FastaParser.parse(FastaParser.java:54)
at org.usadellab.trimmomatic.trim.IlluminaClippingTrimmer.loadSequences(IlluminaClippingTrimmer.java:110)
at org.usadellab.trimmomatic.trim.IlluminaClippingTrimmer.makeIlluminaClippingTrimmer(IlluminaClippingTrimmer.java:71)
at org.usadellab.trimmomatic.trim.TrimmerFactory.makeTrimmer(TrimmerFactory.java:32)
at org.usadellab.trimmomatic.Trimmomatic.createTrimmers(Trimmomatic.java:59)
at org.usadellab.trimmomatic.TrimmomaticPE.run(TrimmomaticPE.java:552)
at org.usadellab.trimmomatic.Trimmomatic.main(Trimmomatic.java:80)
Quality encoding detected as phred33
then no progress happens, only empty space.
Thank you in advance
Hi GenoMax
Thank you for your answer,
I have two groups of RNA *.gz files, as fastqc showed, before trimming;
1- with Nextera adapters
2-with Illumina Universal adapters
The example above is for those from group 1.
I uploaded the Nextera-adapters-fa file into the same directory where I have my *.gz files, then start this argument;
Thank you
If the program is running i.e. your system prompt has not returned then let the trimming complete. It could take some time depending on the size of your files. You can open a new terminal (log in to remote server, if needed) and go to this directory. Listing should show growing files with names you chose above. A process monitoring tool like
top
orhtop
should showtrimmomatic
process actively running, consuming CPU cycles.Many thanks, GenoMax, yes it was a matter of time,
GenoMax . However, as I know, supercomputers (servers) are very fast, so is it normal that some bacterial RNA-seq gz.* files , took more than 1 h to be processed? is there any modification on the command line that can accelerate the trming process!
Hi GenoMax,
Sorry for my late reply. Actually it was a problem in the sever itself as I received a notification later.