for loop with basename created one merged sam file instead of multiple separate sam files
0
0
Entering edit mode
2.7 years ago

Hello!

I wrote code to have my 40 fastq files be aligned to SAM files, but after running this script it ended up created one big SAM file. I've used this same formatting before for other purposes, so I'm wondering why it merged everything?

for i in *_pass.trimmed.fastq; do
  name=$(basename ${i}_pass.trimmed.fastq); bowtie2 \
     -p 8 \
     -x mm10 \
     -U ${name}_pass.trimmed.fastq \
     -S ${name}.sam 
done

One example of a file name is SRR6165747_pass.trimmed.fastq which goes from the ending numbers 47 to 90. I was expecting to have output files SRR6165747.sam up to 97, but in my output folder there was one file named *.sam

Helping me re-format this would be greatly appreciated, thank you!

bash loop bowtie2 • 1.3k views
ADD COMMENT
0
Entering edit mode

I believe you did not declared $.

Try this (not tested):

for i in $(ls *_pass.trimmed.fastq | rev | cut -c 20- | rev | uniq); 

do name=$(basename ${i}_pass.trimmed.fastq); bowtie2 \ -p 8 \ -x mm10 \ -U ${name}_pass.trimmed.fastq \ -S ${name}.sam

done;
ADD REPLY
0
Entering edit mode

This isn't necessary. Bash understands globbing (assuming you have your shell set up correctly), so for i in /*.ext ; do ... is valid.

Almost certainly the basename manipulation you're doing isn't resulting in the filenames you think it is, so bowtie ends up writing to the same .sam. I would have assumed it would overwrite, not append, but perhaps it has some particular behaviour.

I suggest you just debug the loop by replacing your bowtie command with some echo's to make sure you are getting the right filename format. You should also quote your variables: "${var}".

I would start with something like:

for i in *_pass.trimmed.fastq; do
  name=$(basename "${i}"_pass.trimmed.fastq)
  #bowtie2 -p 8 -x mm10 -U "${name}"_pass.trimmed.fastq -S "${name}".sam 
 echo "$name"
done

For me personally I tend not to reach for basename out of habit and use bash operators instead:

for file in /path/to/files* ; do
    command -i "$file" -o "${file%.*}"_suffix.extension
done
ADD REPLY
0
Entering edit mode

Thank you for the reply! I changed my code to write with bash operators, and got it working. One follow up question I had though is that I was not able to specify an output file path, I had to let the output files be made in the same directory. I was wondering if this was due to the way my code was written?

Code that worked:

for file in
/project/nilslind_369/cfausto/errg-hnf1b/input/posttrim-test/*_pass.trimmed.fastq;
do bowtie2 -p 8 -x mm10 -U "$file" -S "${file%.*}".sam done

Code that failed:

for file in
errg-hnf1b/input/posttrim-test/*_pass.trimmed.fastq;
do bowtie2 -p 8 -x mm10 -U "$file" -S errg-hnf1b/output/SAM/"${file%.*}".sam done

I know I can simply move everything afterwards, but was wondering if there's a way I can implement this into the loop?

ADD REPLY
0
Entering edit mode

The "${var%.*}" syntax only strips off the part of the filename after the trailing . of the file extension, essentially (or after whatever you switch . to):

$ var="/path/to/filename.ext1.ext2"
$ echo $var
/path/to/filename.ext1.ext2
$ echo ${var%.*}                                                                                                                                                                
/path/to/filename.ext1

This means it leaves the path structure intact. If you want to obtain just the basename of the file, then this is where basename actually does become the better option, since basename can also take an argument to remove file extensions as well. This can be done with bash operators, but not in a single step (requires intermediate variables):

$ basename "$var" .ext1.ext2
filename

The equivalent operator to do 'left handed' stripping instead of 'right handed' is # instead of %.

Essentially the above code doesn't work because you're adding a path to a path, not to the filename. This is why I suggest you echo your variables first, so you understand what has happened to them.

ADD REPLY

Login before adding your answer.

Traffic: 2068 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6