I am a student and am attempting to perform genome assembly using spades.py on SRR10676752. I trimmed the right and left fastq files using trimmomatic. I have tried the following commands:
spades.py -t 32 -m 64 -o results -1 SRR10676752_1.fastq.gz -2 SRR10676752_2.fastq.gz
spades.py -t 32 -m 64 -o results -1 SRR10676752_1_trimmed.fastq.gz -2 SRR10676752_2_trimmed.fastq.gz
spades.py -t 32 -m 64 --carful --phred-offset 33 -o results -1 SRR10676752_1.fastq.gz -2 SRR10676752_2.fastq.gz
spades.py -t 32 -m 64 --carful --phred-offset 64 -o results -1 SRR10676752_1.fastq.gz -2 SRR10676752_2.fastq.gz
All result in same error (255). Results from zcat SRR10676752_1_trimmed.fastq.gz | awk '{s++}END{print s/4}'
and from zcat SRR10676752_2_trimmed.fastq.gz | awk '{s++}END{print s/4}'
do not result in the same count. One is substantially larger than the other for both the trimmed files and untrimmed raw fastq files.
I read on a thread on GitHub to use fastq_pair, however this is a tool that our college server does not have and I am not authorized to install. We were given specific instructions to create pipeline using the tools we have learned thus far, and I cannot thing of any tool we have learned of that could fix this issue.
I am currently in the process of trying a polish of raw data using zcat SRR10676752_1.fastq.gz | head -n 40000000 >Polished_R1.fastq and zcat SRR10676752_1.fastq.gz | head -n 40000000 >Polished_R1.fastq, followed by a gzip of each Polished file and then will try spades again, but this is not the way that we were taught. We are supposed to create a bash script pipeline using this specific SRA ID, which I could do if I had an SRR fastq file to work with that cooperated with spades.py for assembly instead of this one.
Any advice from someone who knows how to handle this type of situation would be great! Only help on how to handle the error with spades, though.
Thanks in advance!
Did you trim the paired-end files independently? If you had trimmed the files together it would have prevented this problem from occurring.