VEP cache trancript versions
2
0
Entering edit mode
4.4 years ago
Chadi Saad ▴ 110

I have a same variant obtained once on GRCH37 and once on GRCH38 I want to annotate it with VEP to obtain the HGVS annotation. So I use VEP (v100) to annotate both variants.

On GrcH37, the variant is annotated with NM_001160184.1, while for GRCH38, this variant is annotated with NM_001160184.2.

It seems that the problem is due to the fact that VEP don't use the same RefSeq database (see : https://m.ensembl.org/info/docs/tools/vep/script/vep_cache.html)

RefSeq version:

2019-06-28 (GCF_000001405.39_GRCh38.p13_genomic.gff) for GRCh38 data 2019-11-01 (GCF_000001405.25_GRCh37.p13_genomic.gff) for GRCh37 data

Why VEP does not use use the same GFF file ? how to obtain the annotation with the same transcript on both genome versions ? (NM_001160184.1 for grch 37 and 38)

Thanks

vep annotation NGS sequencing dna-seq • 1.8k views
ADD COMMENT
1
Entering edit mode
4.4 years ago

Hi Chadi, I'm afraid there's no "same" GFF file for GRCh37 and GRCh38. You can find all RefSeq versions under the following link: https://ftp.ncbi.nlm.nih.gov/genomes/refseq/vertebrate_mammalian/Homo_sapiens/all_assembly_versions/ We import the latest RefSeq version for both builds. Hope that helps. Best wishes, Michal Ensembl Helpdesk

ADD COMMENT
0
Entering edit mode
4.4 years ago
JC 13k

Why VEP does not use use the same GFF file ?

They can't, both reference genomes are not the same, so the annotation and coordinates in some regions are different

ADD COMMENT
0
Entering edit mode

we can't have something like GCF_000001405.39_GRCh37_genomic.gff (the same GCF used for GRCH38 GCF_000001405.39_GRCh38.p13_genomic.gff) ? why using GCF_000001405.25 for GRCh37

ADD REPLY

Login before adding your answer.

Traffic: 1880 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6