probability of two samples sharing a SNP by error
1
0
Entering edit mode
3.2 years ago
SemiQuant ▴ 80

I'm trying to calculate the probability of the same snp occurring in two different samples by chance. I'm pretty lost here, below are the parameters I have, I'm calling SNPs at ≥1% of the reads in targeted, ultradeep sequence data.

gene_length = 500
error_rate = 0.01
read_depth = 100000
gwas snp illumina • 1.0k views
ADD COMMENT
0
Entering edit mode

What does by chance mean? Do you think errors show up from totally random cosmic rays, or maybe something intrinsic in the instrumentation. If you used the same instrument and reagents and prep method, then systematic errors will occur in many samples 'better than chance'. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3141275/

ADD REPLY
0
Entering edit mode

Thanks. By chance, I just meant that we ignore the systematic error and base the calculation on a random error rate of 0.01 I'll have a read of that reference.

ADD REPLY
0
Entering edit mode
3.2 years ago
SemiQuant ▴ 80

Would this give me the probability of the same error in two samples?

r1 <- error_rate*(read_depth/1000)
(r1*r1)*gene_length
0.05
ADD COMMENT

Login before adding your answer.

Traffic: 2984 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6