Question

varscan copynumber producing highly variable data

0

Entering edit mode

11.0 years ago

ryan.hartmaier • 0

I ran varscan copynumber on my high coverage, whole genome paired end data (matched normal @ 30x & tumor @ 65x after duplicate removal) using default parameters and followed the workflow on the varscan webpage (copynumber, copycaller, DNAcopy segmentation following the R code sample from Chris). Plotting the output using DNAcopy plot function revealed that the seg means are extremely noisy and over-segmented (looks like a cloud ranging from -2 to 2 on a log2 scale). Oddly enough, the chrUn chromosomes produce much more 'normal' looking plots with a red seg mean line (instead of a cloud) that is mostly around 0. I suspect that maybe the default parameters are not appropriate for high coverage genome-wide data? I'm not sure what to change though. Should I modify --max-segment-size and/or --min-segment-size in varscan copycaller? Any ideas?

varscan copynumber • 3.5k views

ADD COMMENT • link updated 3.7 years ago by Ram 45k • written 11.0 years ago by ryan.hartmaier • 0

Ram · Answer 1 · 2014-07-24

0

Entering edit mode

11.0 years ago

poisonAlien ★ 3.2k

Hi,

Did you normalize your data for differences in library size ( as you mentioned normal 30X and tumor 65X) ?

VarScan2 copynumber has a parameter --data-ratio which accounts for this difference.

ADD COMMENT • link 11.0 years ago by poisonAlien ★ 3.2k

0

Entering edit mode

Yes, sorry I should have included that in my post. I used --data-ratio to adjust for the difference in sequence coverage. All other options were default.

ADD REPLY • link updated 3.7 years ago by Ram 45k • written 11.0 years ago by ryan.hartmaier • 0