Hi, I want to use sciclone on our exome sequencing data. but one thing I can't understand that is how can I got varCount equal to 0? I have no idea about this,
following data i just grep from sciclone-meta-master manuscript figure3 data vaf input:
refCount varCount
261 0
102 0
1462 0
so if that is the case, I totally wrong? I used GATK merged vcf, and use R (data.table) to manipulate and got final input files.
I just wonder if anyone can help me and give me some advice. and would you mind to teach me how to calulate (vcf ./.) the depths for those paired samples only one sample have mutaiton but other one no mutation? bedtools coverage? or other ways?
Often a mutation will only be present in one sample and not the other. This is not unexpected. We typically use bam-readcount to pull VAFs from both bams for all sites.
I just wonder which tools you used? Actually, I use VarScan, Conifer and CNVkit to call CNV, segmented by DNAcopy. but when I got segment CNV should I convert by Bedtools -w 50k (or other num) to make window distance, and re-calulated again? or just used DNAcopy segment CNV as input file for sciclone? before input to sciclone should I remove those CNV changed region or I can give all the info to sciclone, later use copyNumberMargins=0.25 to filter out?
Also I saw you CNV segment value already changed back to real CNV copy number: like (2^0.5)*2, but in R script still use
copyNumberMargins=0.25
Thanks for you help and your tools are really really nice and help our team a lots.
Haitao
Yes, it's real copy number (2 = cn neutral, 3 - amp, etc). 0.25 is the distance away from 2 that is allowed for CN call to be considered neutral
The results are from varscan exome, run through the DNAcopy package for segmentation. It identifies regions of concordant CN, so yes, the lengths will be variable. Any similar CN and segmentation algorithm should be fine.
I have one more question for building clonal evolution, since when I use sciclone to calulate subclone, I already remove those un-neutral region, after got subclone info, can I add those CNV changed region back to build evolution event? I ran THetA2 succesfully, I plan use sciclone + THetA2 to test the evolution. plot by fishplot. Thanks. Haitao
There are options like Theta that allow for merging that information back in, but no supported procedure for merging the two that I know of. You're welcome to invent one, as it's probably possible.
Hi Chris,
I also want to know how to convert the bam-readcount output file to the vaf data required by sciclone. I hope to get your help! thank you very much!
One way is to use something like VATools to annotate your VCF with that information, then extract it by parsing the VCF. https://github.com/griffithlab/VAtools
Dear Miller,
Thanks so much for you reply so quickly, I will try this bam-readcount to pull VAFs from both bam.
One more question: for your CNV segment file, I found those distance of segment windows are so diverse in different chromosome:
I just wonder which tools you used? Actually, I use VarScan, Conifer and CNVkit to call CNV, segmented by DNAcopy. but when I got segment CNV should I convert by Bedtools -w 50k (or other num) to make window distance, and re-calulated again? or just used DNAcopy segment CNV as input file for sciclone? before input to sciclone should I remove those CNV changed region or I can give all the info to sciclone, later use copyNumberMargins=0.25 to filter out?
Also I saw you CNV segment value already changed back to real CNV copy number: like (2^0.5)*2, but in R script still use
Thanks for you help and your tools are really really nice and help our team a lots. Haitao
I will modify my pipeline based on your suggestion. Thanks! Haitao
Dear Chris,
I'm following what you mentioned in this post, I got bam-readcount output file now:
However, I don't know which tool can help me to convert bam-readcount output to like this VAF output:
Thanks in advance, Haitao