Hi. I'm downloading some CRAM files from 1000genomes for use in variant calling. I will convert them to BAM (since most tools can't call from CRAM files). I'm a bit confused about how to go about this process since I assumed a straightforward Samtools based conversion based on a given reference FASTA.
I've already downloaded the hg38 FASTA from Human Genome Resources
You have to download the files referenced by the checksums found on the cram headers. To decompress the cram files, you need exactly the same reference as used for compression. To ensure the correct reference is used, the 1000genomes cram files contain the identity of the reference contigs used for compression, this identity is given by MD5 or SHA1 checksums.