I have one question about permutation method that used in TCGA paper (Somatic mutations affect key pathways in lung adenocarcinoma, 2008, Nature) and MutSic method.
In these papers, they applied permutation method for finding co-occurring or mutual exclusive genes. In the supplementary method, they well described about this method but I can't understand the difference between test1 and test2. Does anyone understand the difference?
Concurrence and mutual exclusion analysis We performed two slightly different permutation tests for mutation correlation between genes (i.e. concurrence or exclusion of mutations). In both tests, we take inter-individual differences of gene mutation prevalence into consideration. FDR corrections have been applied.
Test 1: we kept the number of samples mutated in a given gene and the number of genes mutated in a given sample the same as observed in the data and randomly permuted the observed mutations across samples and genes. For each permutation, we recorded the number of samples with concurrent and exclusive mutations (Xc and Xe, repectively) for each pair of genes and compared them with the numbers observed in the original data (Nc and Ne). We repeated this process 10,000,000 times and summarize the frequencies of Xc>=Nc and Xe>=Ne, respectively. These frequencies are used as empirical p-values under the null hypothesis (no correlation between genes).
Test 2: we kept the number of mutations in a given gene and sample the same as observed in the data and randomly permuted the observed mutations across samples and genes. The rest of the analysis is the same as above and we performed 1,000,000 permutations.
Thanks so much :)
Thanks so much Chris :)) Now I clearly understand the difference! Thanks