I'm attempting to use STAR to index the mouse genome. I'm using the following command:
/opt/conda/envs/STAR/bin/STAR --runMode genomeGenerate --runThreadN 8 --genomeChrBinNbits 12 --limitGenomeGenerateRAM 60000000000 --genomeDir /desktop/output/mouse_genome_index/ --genomeFastaFiles /desktop/mouse_input_data/mouse_gencode_transcripts.fa --sjdbGTFfile /desktop/mouse_input_data/mouse_gencode_annotation.gtf --genomeSAsparseD 3
I downloaded the mouse genome FASTA and GTF files from the GENCODE website : https://www.gencodegenes.org/mouse/ I used the following GTF file
and this fasta file: However, I encountered an error that I'm having trouble understanding:
Fatal INPUT FILE error, no valid exon lines in the GTF file: /desktop/mouse_input_data/mouse_gencode_annotation.gtf Solution: check the formatting of the GTF file. One likely cause is the difference in chromosome naming between GTF and FASTA file.
it is related to the GTF file, but I don't know which GTF file I have to download from gencode in this case ( --sjdbGTFfile )