Hi,
I've performed variant calling using the GATK GRCh38 genome reference. The reference is gencode v27 (ensembl version 90 from gtf header), and my understanding is that when using ensembl vep, you should use the same cache version as was used for alignment. I downloaded GRCh38 v90 following instructions from ensembl vep to get the correct cache
curl -O https://ftp.ensembl.org/pub/release-90/variation/indexed_vep_cache/homo_sapiens_vep_90_GRCh38.tar.gz
tar xzf homo_sapiens_vep_90_GRCh38.tar.gz
(The expanded tar then contains homo_sapiens/90_GRCh38)
I then run ensembl vep as such:
vep -i $inputvcf --dir_cache /path/to/90_GRCh38 --cache --cache_version 90 -o $outputvcf
However on the html output from vep it says the annotation source as "Cache: /home/Software/ensembl-vep/Databases/GRCh38_V90_GATK/homo_sapiens/90_GRCh38; homo_sapiens_core_111_38 on ensembldb.ensembl.org"
My VEP version is 111, and I'm having issues re-downloading an older version of ensembl-vep. Is it possible to run backwards compatible caches with a newer version of ensembl vep? It seems to be using v111 from ensembldb.enembl.org despite me setting the version and cache dir.
Any help appreciated, thanks!
Tagging: Louisse_Ensembl