How to run STAR for multiple fastq files without overwriting?
1
0
Entering edit mode
5.7 years ago
gokberk ▴ 90

Hi all,

I'm new to RNAseq data analysis and want to generate BAM files from 204 fastq files. So, I wrote a very basic for loop to run the same command for all my fastq files. However, since STAR saves every BAM file with the same name, I'm having an overwriting problem. Here is the code that I run:

for ((i=102; i<=305; i++)); do ./STAR --readFilesIn fastq/E08/SRR2930$i.fastq --outSAMunmapped Within --outSAMtype BAM SortedByCoordinate --outSAMmultNmax 1 --genomeDir STARindex --twopassMode Basic --runThreadN 12; done

I tried --outFileNamePrefix option as well, but could not solve the issue. I'd be glad if you could help me with this.

Cheers, Gökberk

STAR • 3.8k views
ADD COMMENT
1
Entering edit mode

(Not exactly answer) You can speed up your command. Your command doesn't mention the "--genomeLoad" method. In that case STAR defaults "NoSharedMemory" i.e loading the index and removing each time you run the command. You can take a look at this post STAR genomeLoad issue.

ADD REPLY
3
Entering edit mode
5.7 years ago

but could not solve the issue.

Please, could you argument the above ?

Did you try something like :

--outFileNamePrefix SRR2930_${i}
ADD COMMENT
0
Entering edit mode

Previously, I tried --outFileNamePrefix $i, but that didn't work for some reason. now I tried --outFileNamePrefix SRR2930$i and it worked fine, thanks a lot!

ADD REPLY
2
Entering edit mode

$i needs to stay connected to the other word for it not to be treated as a different option. You may want to use --outFileNamePrefix SRR2930_${i} so the names are easier to read.

ADD REPLY
0
Entering edit mode

I always forget the brackets but they are good practises indeed

ADD REPLY

Login before adding your answer.

Traffic: 1925 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6