hi everyone
i annotated my vcf file (resulted from samtools) using snpEff and output of snpEff confused me. in html output the "number of SNPs" is 2.5 million while "number of effects" is 3.8 million. this case about indels is higher even, number of indels is 350,256 while "number of effects" is 9 million. what happened? is it normal result? if i want that number of variant and "number of effects" be equal, what should i do?
thanks in advance
Hi ,
What do you mean by number of function ?
it is a part of snpEff results in html format.
enlight us please and answer Titus's question: Please show us an example of such "number of functions". What does it mean ? GO Terms ?
i am so sorry, it is "number of effects" not "number of function". the results is like this:
Number of lines (input file) 2,559,765
Number of variants (before filter) 2,560,952
Number of not variants (i.e. reference equals alternative) 0
Number of variants processed (i.e. after filter and non-variants) 2,560,952
Number of known variants (i.e. non-empty ID) 0 ( 0% )
Number of multi-allelic VCF entries (i.e. more than two alleles) 1,187
Number of effects 3,891,852
If i understand the question is why there is more effect than variant ? isn't it ? The think is you have multi transcripts for an unique gene than you the all transcript concerned by the variant.
yes my question is what you said. there is any way that variants and effects be equal?
If i remember well there is no option for that kind of output. You could do that if you have a list of transcript ( see this page http://snpeff.sourceforge.net/SnpEff_manual.html ). The only condition you need is no overlapping over yours positions transcripts. An other way is to use VEP quit similar to snpEFF ( http://www.ensembl.org/info/docs/tools/vep/index.html ) and which output variant annotation per line for different transcripts.