Hi everyone,
I'm trying to run DiscoSNP++ on data from the Tara Oceans Expedition but I ran into a problem that i have trouble solving. The input I'm using are two very large .gz files (study accession PRJEB4352). The output that I need are a .fa file and a .vcf file, but those don't show up. This is the script I used for the job which took about 40 minutes to complete:
#!/usr/bin/env bash
#PBS -l nodes=1:ppn=1
#PBS -l walltime=02:00:00
#PBS -l pmem=8gb
#PBS -A default_project
cd /vsc-hard-mounts/leuven-data/341/vsc34135/Laso-Jadart
source /vsc-hard-mounts/leuven-data/341/vsc34135/miniconda3/etc/profile.d/conda.sh
conda activate discosnp_env
curl -o ERR868369_1.fastq.gz ftp://ftp.sra.ebi.ac.uk/vol1/fastq/ERR868/ERR868369/ERR868369_1.fastq.gz
curl -o ERR868369_2.fastq.gz ftp://ftp.sra.ebi.ac.uk/vol1/fastq/ERR868/ERR868369/ERR868369_2.fastq.gz
run_discoSnp++.sh -r fof_1.txt –k 51 -b 1 -T
With fof_1.txt being: ERR868369_1.fastq.gz ERR868369_2.fastq.gz
This is part of the error job output
And this is part of the job output:
I'm quite new to making scripts and working with the VSC so any help is appreciated.
Hi Pierre.
Thanks for the fast response. I've tried out your solution together with moving to a smaller dataset and adding some extra "just to be safe" code. It doesn't seem to work though. I still get the same errors... Here's my tweaked script
with ofunato_1.txt:
The job takes less than a minute to run, so it must be something in the very beginning that goes wrong.
Hello Paulien
I've no issue with the master and the last released (2.6.2) versions (on an osx machine), using exactly your command lines.
Can you make a try with the last release https://github.com/GATB/DiscoSnp/releases (you may either install the sources or use any of the precompiled versions mac or Linux)
Depending on your feedback we will update the conda package.
I hope this helps.
Hi Pierre,
I tried installing the latest version via the sources but I get several error messages.
gives me:
and then it just stops and goes back to the command line. I continued with
Which gives me several errors:
I honestly don't really know how to install DiscoSNP using a precompiled version.
PS: is it right that I don't get the latest version using conda? I get version 1.5.3.