Entering edit mode
4.4 years ago
bioinfo_ga
▴
70
Hi, For Human analysis as there has been update in the reference genome (GRCh38)build version from p10 to p13, i observed a drastic change in terms of the genome size as p.10 was 3 GB whereas p13 is 60 GB, what is the reason?
Thanks.
That is because you are mixing up "primary" assembly with one that contains "alternate haplotypes".
ohh i see, ok let me see both the files. Thanks for the help.
From where do you want to take your Human Reference Genome ? Because on NCBI the GRCh38.p10.fna compressed is 911MB and the GRCh38.p13.fna is 920MB
Patch differences from the previous one are inside README_patch_release.txt
ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/405/GCA_000001405.28_GRCh38.p13/
From Ensembl database : [http://www.ensembl.org/Homo_sapiens/Info/Index]
The Homo_sapiens.GRCh38.dna.toplevel.fa.gz for GRCh38.p13 is 1.0GB
Also, Ensembl releases come from The Genome Reference Consortium