Hi,
I have a vcf file (from Mutect2) - list of somatic mutations and their genomic locations & other info.
Does anyone know that there is a R package that can calculate the number of somatic mutations per gene?
For example, map the somatic mutations in the vcf file to any one gene based on their location and count how many mutations in each gene?
I can do it myself manually, but if there's a package that can do that, I can save time.
Thanks..
As far as counting, I don't know of any, but
snpEff
can annotate SNPs, mapping them to genes and qualifying them as "HIGH", "MEDIUM", or "LOW" severity. You can then filter for SNPs that are meaningful to your project.From there you could load the data into R, convert the gene data into a
factor
, then run thesummary
command to count the instances of each gene.Yes, I can annotate the mutations using VEP (https://useast.ensembl.org/info/docs/tools/vep/index.html), and then count them per gene on my own.
But I just wondered if there's a tool to do all of them automatically.
Thanks for your comment anyway.