This question is a semi-follow up to a previous question that I posted, but I have more information now and the nature of the question has changed.
I am assembling Illumina reads and using a PacBio assembly as the "-long" input. The PacBio assembly is in a .fa format, not fq, because it is an assembly and does not have quality scores associated with it. When I run the following command in ABySS to assemble all of this information together:
abyss-pe k=51 np=28 j=28 v=-v n=1 in="cinaIT.end1.mcf.fq.gz cinaIT.end2.mcf.fq.gz" long="polished_assembly.fasta" name=ITpb_AB19
The assembly errors out at the bwa step, even when BWA is in my PATH for the assembly. I ran a dry run of the ABySS commands needed to complete the assembly and they are as follows:
bwa mem -a -t24 -S -P -k51 ITpb_AB19-8.fa \
|gzip >polished_assembly.fasta-8.sam.gz
abyss-longseqdist -k51 polished_assembly.fasta-8.sam.gz \
|grep -v "l=" >polished_assembly.fasta-8.dist.dot
abyss-scaffold -v -k51 -s200 -n1 -g ITpb_AB19-8.path.dot ITpb_AB19-8.dot polished_assembly.fasta-8.dist.dot ITpb_AB19-8.path
PathConsensus -v --dot -k51 -p0.9 -s ITpb_AB19-9.fa -g ITpb_AB19-9.dot -o ITpb_AB19-9.path ITpb_AB19-8.fa ITpb_AB19-8.dot ITpb_AB19-8.path
cat ITpb_AB19-8.fa ITpb_AB19-9.fa \
|MergeContigs -v -k51 -o ITpb_AB19-10.fa - ITpb_AB19-9.dot ITpb_AB19-9.path
ln -sf ITpb_AB19-10.fa ITpb_AB19-long-scaffs.fa
PathOverlap --overlap -v -k51 --dot ITpb_AB19-9.dot ITpb_AB19-9.path >ITpb_AB19-10.dot
ln -sf ITpb_AB19-10.dot ITpb_AB19-long-scaffs.dot
abyss-fac ITpb_AB19-unitigs.fa ITpb_AB19-contigs.fa ITpb_AB19-scaffolds.fa ITpb_AB19-long-scaffs.fa |tee ITpb_AB19-stats.tab
ln -sf ITpb_AB19-stats.tab ITpb_AB19-stats
tr '\t' , <ITpb_AB19-stats.tab >ITpb_AB19-stats.csv
abyss-tabtomd ITpb_AB19-stats.tab >ITpb_AB19-stats.md
I've tried running the bwa command directly in the BWA program, but it returns a blank file. I'm wondering if bwa command is right? From what I've gathered from the manual, the bwa mem command asks for two inputs (one .fa and one .fq) and the dry run shows that the assembler is only providing one input file.
Any help on this is greatly appreciated, thank you!
Hi Benv,
Thank you so much for this solution! It worked!