Question

probability of two samples sharing a SNP by error

0

Entering edit mode

3.5 years ago

SemiQuant ▴ 100

I'm trying to calculate the probability of the same snp occurring in two different samples by chance. I'm pretty lost here, below are the parameters I have, I'm calling SNPs at ≥1% of the reads in targeted, ultradeep sequence data.

gene_length = 500
error_rate = 0.01
read_depth = 100000

gwas snp illumina • 1.2k views

ADD COMMENT • link updated 23 months ago by Ram 45k • written 3.5 years ago by SemiQuant ▴ 100

0

Entering edit mode

What does by chance mean? Do you think errors show up from totally random cosmic rays, or maybe something intrinsic in the instrumentation. If you used the same instrument and reagents and prep method, then systematic errors will occur in many samples 'better than chance'. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3141275/

ADD REPLY • link 3.5 years ago by karl.stamm 4.1k

0

Entering edit mode

Thanks. By chance, I just meant that we ignore the systematic error and base the calculation on a random error rate of 0.01 I'll have a read of that reference.

ADD REPLY • link 3.5 years ago by SemiQuant ▴ 100

score 0 · Answer 1 · 2021-09-22

0

Entering edit mode

3.5 years ago

SemiQuant ▴ 100

Would this give me the probability of the same error in two samples?

r1 <- error_rate*(read_depth/1000)
(r1*r1)*gene_length
0.05

ADD COMMENT • link 3.5 years ago by SemiQuant ▴ 100