Hi
I have a compiled segmentation file (nocnv) for almost 1000 samples. I want to calculate the copy number aberrations for all the samples.
I tried using GISTIC, but I believe it provides only the consensus copy number variations. Is there a better way to calculate copy number aberration from segmentation file.
Here are few lines from the segmentation file I have:
Sample Chromosome Start End Num_Probes Segment_Mean
TCGA-E2-A1IH-01A-11D-A13J-01 1 3208470 42629316 20791 -0.2864
TCGA-E2-A1IH-01A-11D-A13J-01 1 42631290 42692913 60 0.0183
TCGA-E2-A1IH-01A-11D-A13J-01 1 42701180 42918495 113 0.3411
TCGA-E2-A1IH-01A-11D-A13J-01 1 42925163 43153569 131 -0.0548
TCGA-E2-A1IH-01A-11D-A13J-01 1 43153652 43599777 288 -0.3653
TCGA-E2-A1IH-01A-11D-A13J-01 1 43600151 44642193 575 -0.0401
Thanks
Meenakshi
When you say segmentation file, that sounds like it already contains CNV segments. Can you show us some of the file so we know what you need to do?
I added some sample lines from the segmentation file for your review.
Thanks!
Yes, I am trying to calculate the Genome instability index per sample using the CNA from each sample. So, I want to quantify the CNAs using the segmentation file for each sample.
Thanks
This is called data aggregation. Although you can do this in any programming language, I think for you it is easiest to do with Excel pivot tables. Follow some tutorials on how to use pivot tables and then apply to your segmentation results. As Charles warden suggested, you might want to add a column that describes if a segment meets several criteria (length, amplitude) and is therefore considered significant.
Could you explain how you calculated the genome instability using CNA? Thanks