Hi there! This is probably a VERY basic question but I don't have the best terminal skills so I'm struggling a little.
I want to apply what I wrote below for all my fastq scripts without doing a for loop or manually writing the code for each (ideally they all run in parallel, or at least in the same script):
STAR \
--runThreadN 12 \
--genomeDir /scratch/eg5/trial2/align/sequence/STARindex \
--readFilesCommand gunzip -c \
--readFilesIn /scratch/eg51/trial2/trimmed_files/SRR12045658_1_trimmed.fq.gz \
--sjdbGTFfile /scratch/eg51/trial2/align/annotation/genes/GRCm39.ens.gtf \
--outFileNamePrefix /scratch/eg51/trial2/align/results/STARresults \
--limitGenomeGenerateRAM 32000000000 \
--outSAMtype BAM SortedByCoordinate
I tried to use *.fq.gz
, but that did not work and I couldn't specify the outFileNamePrefix as /*
STAR \
--runThreadN 12 \
--genomeDir /scratch/eag519/trial2/align/sequence/STARindex \
--readFilesCommand gunzip -c \
--readFilesIn /scratch/eag519/trial2/trimmed_files/*.fq.gz \
--sjdbGTFfile /scratch/eag519/trial2/align/annotation/genes/GRCm39.ens.gtf \
--outFileNamePrefix /scratch/eag519/trial2/align/results/STARresults/* \
--limitGenomeGenerateRAM 32000000000 \
--outSAMtype BAM SortedByCoordinate
I also tried putting all the files in the same line but that resulted in the following error:
SOLUTION:specify only one or two files in the --readFilesIn option. If file names contain spaces, use quotes: "file name"
STAR \
--runThreadN 12 \
--genomeDir /scratch/eag519/trial2/align/sequence/STARindex \
--readFilesCommand gunzip -c \
--readFilesIn /scratch/eag519/trial2/trimmed_files/SRR12045658_1_trimmed.fq.gz, /scratch/eag519/trial2/trimmed_files/SRR12045661_1_trimmed.fq.gz, /scratch/eag519/trial2/trimmed_files/SRR12045664_1_trimmed.fq.gz, /scratch/eag519/trial2/trimmed_files/SRR12045667_1_trimmed.fq.gz \
--sjdbGTFfile /scratch/eag519/trial2/align/annotation/genes/GRCm39.ens.gtf \
--outFileNamePrefix /scratch/eag519/trial2/align/alignmentresults/STARresults \
--limitGenomeGenerateRAM 32000000000 \
--outBAMsortingThreadN 4 \
--outSAMstrandField intronMotif \
--outFilterIntronMotifs RemoveNoncanonicalUnannotated \
--outSAMattributes All \
--outSAMtype BAM SortedByCoordinate
Anyone have any ideas? i.e., how to enter all the files in , and then have them output with their own file names. Sorry I'm sure this is a very basic question.
Thanks a lot! It didn't work for me for some reason but I'm sure I'll figure it out.
It's a dry run for both parallel and find. Remove
dry-run
if you are using parallel and removeecho
if you are using find command. You need to post what you meant by not working for eg. error logs etc.