Question

Multiple Observations In Gwas Via Snps Or Exome Sequencing/Wgs

0

Entering edit mode

10.9 years ago

Dave Bridges ★ 1.4k

If you test for say 100 000 SNPs in a SNP array for some phenotype, my understanding is that you would correct for 100 000 observations. What about more in depth studies. Do people correct for all >3 million observations in whole genome sequencing (ie nucleotides in hg19), presuming there is no epistasis? Or do they correct based on the number of known variants or some other smaller number?

statistics gwas snp exome-sequencing • 2.3k views

ADD COMMENT • link updated 10.9 years ago by Charles Warden 8.3k • written 10.9 years ago by Dave Bridges ★ 1.4k

score 1 · Answer 1 · 2013-12-22

1

Entering edit mode

10.9 years ago

Charles Warden 8.3k

For most exome studies I have worked with, the goal has typically been to find rare variants and/or variants that are predicted to be damaging. If you apply this sort of filter. This means you will be focusing on a much smaller number of variants (probably even fewer variants than present on a SNP array).

ADD COMMENT • link 10.9 years ago by Charles Warden 8.3k

0

Entering edit mode

so common variants, or non-damaging variants are filtered out pre-statistics?

ADD REPLY • link 10.9 years ago by Dave Bridges ★ 1.4k

0

Entering edit mode

non-damaging: yes (unless they are rare and in a location where functional characterization isn't obvious, such as non-coding regions)

common: depends

I like to check for variants in the GWAS catalog, but the users that I have interacted with haven't typically been very interested in those results.

A sufficiently large SNP array will probably cover the most common variants. Unless you are working with a common disease that hasn't been studied with SNP arrays for some reason, I think you can assume that low hanging fruit has been identified (although I think replication from other studies is worth noting).

ADD REPLY • link 10.9 years ago by Charles Warden 8.3k