Entering edit mode
13 months ago
10mz1
▴
10
Does this mean that the haploid? assembly consists of 705 large molecules? I expected to find a number equal to the number of chromosomes plus the mitochondrial genome.
Not sure where you got the genome from but if you take a look at the headers that should give you an idea of what the (
705 - Normal Chr/MT
) listings are. Most should be unplaced contigs (markedchrUn
), contigs that are unlocalized in a specific spot (>chr22_KI270738v1_random
) etc.https://www.ncbi.nlm.nih.gov/genome/guide/human/
A fasta file is always "haploid" in terms of that regardless of ploidy you always have a single sequence per chromosome. See for a read on other components of the reference genome beyond the "standard" chromosomes: https://gatk.broadinstitute.org/hc/en-us/articles/360041155232-Reference-Genome-Components
There is no such thing as the human genome. There are various human reference genomes (different builds, different versions) and you need to pick the one that fits your application. If there are many ALT contigs included, or it is not a reference genome but an individual assembly, 705 scaffolds might be an accurate number.
You will have to share more info on the genome if you want more specific replies.
The latest human genome 38 reference genome (GRCh38): https://www.ncbi.nlm.nih.gov/genome/guide/human/