Hi there,
I have a rather large VCF file annotated using VEP, and I am trying to pick the most severe consequence of the variant for each variant in the file. Now I know that VEP automatically arranges the output to be listed as most-severe to least according to this (http://uswest.ensembl.org/info/genome/variation/prediction/predicted_data.html#consequences). However, my VCF doesn't have it in that format (no idea why, but I spot checked, and it doesn't) .
I cannot re-make the VCF using --severe because I didn't create it in the first place. My solution is to write a script to do this manually. But before I do that, I'd be interested if anyone out there has encountered a similar situation and have better suggestions for this.
Thanks for the help!
Thanks Ben!! --pick did what I needed. I ended up doing this:
Run VEP again like so:
vep -i {input} -o {output} --cache --merged --pick --pick_order biotype,ccds,rank,canonical --force_overwrite --vcf
My aim is to filter out variants protein coding regions, and most damaging variants . The default --pick order missed a few because of overlapping transcripts, so I ended up reordering the 'pick' criteria to biotype,ccds,rank,canonical . This worked great! Thanks you for the prompt answer!