Question

Annovar gene and cosmic variant annotations don't match

0

Entering edit mode

10.8 years ago

jacobsen.jeremy ▴ 40

I ran GATK variant discovery tools on my bam files and was interested in using the output to see how the variants may be affecting protein production. Additionally I wanted more information on what transcripts the variants can be found in. I formatted the GATK file by removing the header and adding a column for the stop locus. Then I ran annovar with:

perl annotate_variation.pl -geneanno -buildver hg19 gatk_files/... gatk_files/...

The script runs and leaves me with a gene annotated file called ..exonic_variant_function. The problem with the annovar output is that the cosmic variants identifiers don't align with the gene. For instance the first line in the file is:

line262 synonymous SNV NOC2L:NM_015658:exon16:c.C1843T:p.L615L, 1 881627 881627 G A

A cosmic search shows that p.L615L is a variant in KIAA1755 and not in NOC2L. My end goal is to alter the RNA sequence for the transcripts where the variant is found and to code for variant modified proteins in order to build a modified peptide centric database. From what I understand, the cosmic accession number gives the location of the variant from the start site which could make this process easy... if annovar was reporting on the correct variant->transcript.

Any ideas??

Thanks,
Jeremy

RNA-Seq SNP GATK • 3.5k views

ADD COMMENT • link updated 2.4 years ago by Ram 45k • written 10.8 years ago by jacobsen.jeremy ▴ 40

Ram · Accepted Answer · 2014-10-27

1

Entering edit mode

10.8 years ago

karl.stamm 4.1k

so KIAA1755 is on chr20 and NOC2L is on chr1. Your variant is on chr1, so it seems that everything is okay from ANNOVAR.

You mentioned a COSMIC ID# but haven't given one here. pL615L is not an ID#, but a mutation, Leu to Leu. Any protein whose 615th AA is Leu can have a 615L to L mutation. There's probably a lot of them, so you can't just google p.L615L and expect it to refer to the same gene. Your transcript here is NM_015658, whose 615th AA should be Leucine (seen at Uniprot: http://www.uniprot.org/uniprot/Q9Y3T9 ).

KIAA1755 is not involved.

ADD COMMENT • link updated 4.3 years ago by Ram 45k • written 10.8 years ago by karl.stamm 4.1k

0

Entering edit mode

You are absolutely right Karl. I allowed myself to be led astray by the siren calls of Google.

Thanks,
Jeremy

ADD REPLY • link updated 4.3 years ago by Ram 45k • written 10.8 years ago by jacobsen.jeremy ▴ 40