I am trying to understand the factors that could lead to erroneous identification of high burden mutation without controlling ethnicity and how it needs to be addressed. What other QC metrics need to be taken care of while studying driver mutations in cancer from "next generation sequencing data"?
Thank you so much, Dan. This exactly what I need to know. Could please elaborate how this burden test is different from GWAS which looks for rare variant with small effect in germline samples?
Burden tests are typically on a gene-wise basis, so looking for genes with a higher proportion of somatic mutations than what you expect based on chance.
I would recommend reading as many papers as possible to see what sorts of QC and methods they are doing. The best way to learn these things is to immerse yourself in the literature and see what the current standards are in the field. Are you working with tumour-only sequencing data?
Thank you, Dan.