Question

AWS STAR Genome Index Error

0

Entering edit mode

15 months ago

SHXVRR ▴ 20

Hello,

I have been trying to run this line of code for the longest time:

STAR --runThreadN 20 --runMode genomeGenerate --genomeDir genomeDir/ --genomeFastaFiles Homo_sapiens.GRCh38.dna.toplevel.fa --sjdbGTFfile Homo_sapiens.GRCh38.110.chr.gtf

I first tried running it on my home terminal but then realized that that it would take several days so I switched to AWS. I have created an instance for EC2 and then a S2 bucket and then connected it all through IAM. But whenever I try and run it I keep on getting:

Oct 05 03:02:26 ..... started STAR run   
Oct 05 03:02:26 ... starting to generate Genome files        
terminate called after throwing an instance of 'std::bad_alloc'      
  what():  std::bad_alloc
Aborted (core dumped)

I then switched the code to

STAR --runThreadN 20 --runMode genomeGenerate --genomeDir genomeDir/ --genomeFastaFiles Homo_sapiens.GRCh38.dna.toplevel.fa --sjdbGTFfile Homo_sapiens.GRCh38.110.chr.gtf --limitGenomeGenerateRAM 30G --genomeSAsparseD 2

But it still doesn't work. My storage is a max of 30G. Are the files too big and should I be working with smaller ones to create the STAR genome index or is there a better way of dealing with this issue of running a genome index.

Thanks

STAR AWS • 1.0k views

ADD COMMENT • link 15 months ago by SHXVRR ▴ 20

1

Entering edit mode

My storage is a max of 30G

Do you know the difference between memory and disk space? Please find out what your maximum allowed memory is.

ADD REPLY • link 15 months ago by ATpoint 86k

0

Entering edit mode

I didn't mean storage but the instance that I am running the Linux on has a maximum of 30G.

ADD REPLY • link 15 months ago by SHXVRR ▴ 20

score 0 · Answer 1 · 2023-10-05

0

Entering edit mode

15 months ago

GenoMax 148k

See STAR genome generate memory requirements

You are using the wrong genome file.

ADD COMMENT • link 15 months ago by GenoMax 148k

0

Entering edit mode

when I replaced it, I still got the same error:

(base) [ec2-user@ip-172-31-28-142 ~]$ STAR --runThreadN 20 --runMode genomeGenerate --genomeDir genomeDir/ --genomeFastaFiles Homo_sapiens.GRCh38.dna.primary_assembly.fa --sjdbGTFfile Homo_sapiens.GRCh38.110.chr.gtf --limitGenomeGenerateRAM 30G --genomeSAsparseD 2 Oct 05 18:16:00 ..... started STAR run Oct 05 18:16:00 ... starting to generate Genome files terminate called after throwing an instance of 'std::bad_alloc' what(): std::bad_alloc Aborted (core dumped) (base) [ec2-user@ip-172-31-28-142 ~]$

ADD REPLY • link 15 months ago by SHXVRR ▴ 20

0

Entering edit mode

30 GB of RAM is not enough. Consider allocating at least 40G. If not you are not going to be able to do this with VM you have.

ADD REPLY • link 15 months ago by GenoMax 148k

0

Entering edit mode

I created a brand new instance and made it have 50G of Storage but still is giving me the same error

ADD REPLY • link 15 months ago by SHXVRR ▴ 20