Ensemble file Mus_musculus.GRCm38.dna.primary_assembly.fa.gz unzipping to incorrect contents
0
0
Entering edit mode
3.1 years ago

Hi All,

I am trying to create a RSEM index, but am running into an issue with the genome fasta file. When I download Mus_musculus.GRCm38.dna.primary_assembly.fa.gz from ftp.ensembl, it unzips to Mus_musculus.GRCm38.dna.chromosome.1.fa and I get an error that chromosomes are missing. I am using this build because this is the one I used for a previous RNAseq replicate and I want to be consistent. HOWEVER, when I download the new build (Mus_musculus.GRCm39.dna.primary_assembly.fa.gz) and unzip it, it unzips to the correct contents Mus_musculus.GRCm39.dna.primary_assembly.fa.

Can anyone tell me what is happening here? And is there another source where I can download the correct Mus_musculus.GRCm38.dna.primary_assembly.fa.gz file? I already mapped all my reads to the star index created using this specific file/build before I realized this issue during RSEM, so I would rather not do it all again.

genome RSEM Ensembl • 1.9k views
ADD COMMENT
0
Entering edit mode

Can you add a link to the actual file so others can check it?

ADD REPLY

Login before adding your answer.

Traffic: 1285 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6