I have downloaded the Human Refrence Genome from Ensembl and UCSC like below:
UCSC: http://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz
Ensembl: ftp://ftp.ensembl.org/pub/release-99/fasta/homo_sapiens/dna/Homo_sapiens.GRCh38.dna.toplevel.fa.gz
and when I make the uncompressed using gunzip, I get a different size like:
Zip Unzip
UCSC 938 M 3.1 GB
Ensembl 1.0 GB 62 GB
I would like to know is there anything wrong? Because I would like to make an index for aligning and if I use UCSC it takes ~ 1 hour while Ensembl takes 11 hours.
Difference between UCSC and Ensembl genomes
Difference between the Fasta files from UCSC and Gencode/Ensembl