an efficient way to annotate mutations from vcf files using multiple annotation files
1
0
Entering edit mode
5.0 years ago
lait ▴ 180

we do whole exome sequencing at our lab since years, and we already have an established bioinformatics pipeline.

Now we are planning to modify/renew our "annotation" step. Two main points should be done:

  1. We are planning to annotate the vcf files using Ensembl VEP.
  2. Additionally, we have several external annotation files, eg: gnomad, LOVD, Cosmic, Clinvar, and other not so known annotations .. ( some are already included in vep cache files)

So, in addition to VEP's annotations, we want to annotate our VCF files with the annotations present in the external annotation files.

I would like to ask, what could be the most efficient way to do this? I know that VEP can accept external annotation files, when having a specific format. Would this be the way to go? or is it more efficient/faster to use external tools (bedtools, vcf tools, ...etc) to match our mutations with the specific annotations in the annotations files.

Also, are there any additional steps that would increase the efficiency of this whole thing? maybe divide the annotations for the external annotations files into a per-chromosome files ?

any input based on your experience in this would be much appreciated.

vep ensembl cadd clinvar lovd • 1.6k views
ADD COMMENT
0
Entering edit mode
4.9 years ago

If I were setting up an automated NGS re-sequencing pipeline again (like I did in 2014), I would retain local copies of all required annotation databases / files that VEP requires. This provides for:

  • version control
  • reproducibility
  • prevents data being transmitted outside your internal network (if that's an issue due to your data being protected by law)

Your other question cannot really be answered by any of us, and will require you to benchmark it in order to determine the most efficient way.

Kevin

ADD COMMENT

Login before adding your answer.

Traffic: 1892 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6