I have 2 vcf
files from 2 different pipelines and I am trying to compare them. To do so I tried 2 things:
- I used
vcfeval
(forSNV
andindels
separately) to get sensitivity and precision which are quite high for both vcf files - I got the number of events which are common between 2
vcf files
(almost12000
) and also the number of unique events for eachvcf file
. Number of unique events for one files is2300
and for the other one is851
.
Since the same input file was used for both pipeline and for both of them sensitivity and precision are quite high, how can I interpret the high number of unique events for these files. Since sensitivity and precision are quite high, I do not think those unique events are artifact. How would you interpret such results?
It is not unexpected to have different results if you run different pipelines. To interpret the differences in results (i.e, the number of unique results), you should probably try to understand the differences between the two pipelines you used. Without knowing what are the pipeline and the tresholds used in the analysis, I doubt anyone here would be able to provide a more specific interpretation.