STAR aligment generating genome index
1
0
Entering edit mode
2.7 years ago

Hi everyone, I am a newbie in RNA-Seq analysis. So I am going to use STAR and as you know there are some parameters when we are running it.

--runThreadN NumberOfThreads
--runMode genomeGenerate
--genomeDir /path/to/genomeDir
--genomeFastaFiles /path/to/genome/fasta1 /path/to/genome/fasta2 ...
--sjdbGTFfile /path/to/annotations.gtf
--sjdbOverhang ReadLength-1

Among those, what should be the number of threads for one fastq file and approximately how much time will it take (I am using a HPC) My organism is human btw Thanks in advance

index genome rna-seq star • 1.3k views
ADD COMMENT
0
Entering edit mode

So, for generating genome index, you're not going to use any fastq file, this will be only for alignment step.

Regarding the number of threads, the best way is to check how many threads you have available. In my case, I like to use Nthreads - 4. But I have a local server available!

ADD REPLY
0
Entering edit mode

Thanks for your help. So in HPC is 28 okay?

ADD REPLY
0
Entering edit mode
2.7 years ago
pmonsieurs • 0

I used the Human genome fasta file as available on http://ftp.ensembl.org/pub/current_fasta/homo_sapiens/dna/Homo_sapiens.GRCh38.dna_sm.primary_assembly.fa.gz , and GTF file from http://ftp.ensembl.org/pub/current_gtf/homo_sapiens/Homo_sapiens.GRCh38.105.gtf.gz

It took around 25 minutes, also on a HPC using 28 threads. I have to admit that I did not do an optimalization on the number of threads to use (overhead might become the bottleneck when using 28 threads).

ADD COMMENT
0
Entering edit mode

yes I run with the same parameters and mine was took approx 25-26 min. Again thx for the answer :)

ADD REPLY

Login before adding your answer.

Traffic: 2640 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6