How to determine intra-run and inter-run repeatability in statistically sound manner?
Let’s say I have VCF files with germline variants from samples A and B.
Run1: A_rep1, A_rep2, A_rep3, B_rep1, B_rep2, B_rep3,
Run2: A_rep4, A_rep5, A_rep6, B_rep4, B_rep5, B_rep6
I don’t have a “truth” set let’s just assume these are random whole-genome patient non-diseased samples.
For analytical performance I want to determine the variability within the run and between the runs per sample.
Does it make sense for me to compare the full pairwise-matrix table and then average it out for precision/recall etc...
Inter-run example (Run1):
Comparisons
A_rep1 > A_rep2
A_rep1 > A_rep3
A_rep2 > A_rep3
But also, I complete the table
A_rep2 > A_rep1
A_rep3 > A_rep1
A_rep3 > A_rep2
Essentially, since we don’t know which false positives (FP) are real, we need to treat them all equally. Ultimately, this will average out the signal, and then the average precision will equal the average recall.
But I don’t really know if this makes sense or how else to approach the problem given no ground truth.
Other alternatives:
· Take all shared positives across all 3-samples as “the truth” and then use that for each sample. But one bad sample will throw all the rest off (but maybe that’s the point of the measurement). But maybe this conservative approach is more appropriate?
· Jaccard-index
Any thoughts/suggestions from people experienced in this would be greatly appreciated.
Thanks.