Entering edit mode
7.3 years ago
akij
▴
190
I was reading some papers on copy number variations. Some of the papers referred to TCGA website from where they downloaded the copy number variation(CNV) data. Most papers mention that on average less than 100 CNVs are found in a cancer cell but when I look at the CNV data of TCGA website almost all of them have more than 100 CNVs in each sample. So why TCGA has such high CNVs in each sample.
how did you look at TCGA cnv data? Was it from GISTIC output ?
Sample data
I believe that the papers that you're mentioning are looking at population based cnv's rather than sample wise. TCGA uses gistic to identify recurrent CN aberration in a cohort, which are potentially driver events.
Above file is a CBS output and not all entries are copy number alterations. Last column
Segment_Mean
shows how much of a genomic segment differs from normal in log2 scale.