Hello everyone!
I'm a newbie in analyzes with RNA-Seq.
I have paired-end reads data (Sanger / Illumina 1.9) and would like to cut the reads using quality score.
We have visualized data with FastQC and we tested the softwares: sickle, Spade, fastq quality filter (fastx) and AllPaths.
The best result was obtained with the fastq quality filter, but we lost a lot of reads in the process (it does not cut off part of the reads, only excludes the entire read).
I wonder if there is already a program that recognizes the first occurrence of a symbol (eg '#') and cuts the sequence and the quality from that point.
Did you use fastq-Trimmer instead? I think you can also try running Prinseq using
-trim_qual_left, -trim_qual_right
, with given quality threshold. In your case, you mentioned#
. Quickly looking at the Encoding chart in fastq wiki,#
's score is 2 (assuming 1.8+ and 1.9+ are same). So try using threshold 2, which should remove the regions before and after the threshold value.