I performed MultiQC to check the quality of reads before and after data processing. The MultiQC reports did not show significant improvement. May I know why?
First I trimmed the adapters using BBDuk.
for f in `ls -1 *_1.fastq | sed 's/_1.fastq//'`;
do bbduk.sh -Xmx20g in1=$f\_1.fastq in2=$f\_2.fastq out1=../clean_data/$f\_1.fq out2=../clean_data/$f\_2.fq ref=../adapters.fa ktrim=r k=25 mink=10 ftm=5 tbo tpe;
Done
Second, I performed quality trimming:
for f in `ls -1 *_1.fq.gz | sed 's/_1.fq.gz//'`;
do bbduk.sh -Xmx20g in1=$f\_1.fq.gz in2=$f\_2.fq.gz out1=../trimmed_data/$f\_1.fq out2=../trimmed_data/$f\_2.fq qtrim=r trimq=10 maq=10;
Done
Third, I performed error correction using Musket:
for f in `ls -1 *.fq.gz | sed 's/.fq.gz//'`;
do ./../../musket-1.1/musket -k 21 536879812 -p 20 -zlib 9 -o ../corrected_data/$f\.fq.gz $f\.fq.gz;
Done
for f in `ls -1 *.fq.gz`;