sciClone input vaf file?
3
0
Entering edit mode
8.0 years ago
Haitao ▴ 30

Dear All,

Hi, I want to use sciclone on our exome sequencing data. but one thing I can't understand that is how can I got varCount equal to 0? I have no idea about this,

following data i just grep from sciclone-meta-master manuscript figure3 data vaf input:

refCount varCount
261 0
102 0
1462    0

so if that is the case, I totally wrong? I used GATK merged vcf, and use R (data.table) to manipulate and got final input files.

I just wonder if anyone can help me and give me some advice. and would you mind to teach me how to calulate (vcf ./.) the depths for those paired samples only one sample have mutaiton but other one no mutation? bedtools coverage? or other ways?

Thanks so much, Haitao

sciclone exome mutation sequencing • 4.6k views
ADD COMMENT
1
Entering edit mode
8.0 years ago

Often a mutation will only be present in one sample and not the other. This is not unexpected. We typically use bam-readcount to pull VAFs from both bams for all sites.

ADD COMMENT
0
Entering edit mode

Dear Miller,

Thanks so much for you reply so quickly, I will try this bam-readcount to pull VAFs from both bam.

One more question: for your CNV segment file, I found those distance of segment windows are so diverse in different chromosome:

chr1 1 351000
chr1 351001 650000
chr2 1 215000
chr3 1 500000
...

I just wonder which tools you used? Actually, I use VarScan, Conifer and CNVkit to call CNV, segmented by DNAcopy. but when I got segment CNV should I convert by Bedtools -w 50k (or other num) to make window distance, and re-calulated again? or just used DNAcopy segment CNV as input file for sciclone? before input to sciclone should I remove those CNV changed region or I can give all the info to sciclone, later use copyNumberMargins=0.25 to filter out?

Also I saw you CNV segment value already changed back to real CNV copy number: like (2^0.5)*2, but in R script still use

 copyNumberMargins=0.25

Thanks for you help and your tools are really really nice and help our team a lots. Haitao

ADD REPLY
1
Entering edit mode
  • Yes, it's real copy number (2 = cn neutral, 3 - amp, etc). 0.25 is the distance away from 2 that is allowed for CN call to be considered neutral
  • The results are from varscan exome, run through the DNAcopy package for segmentation. It identifies regions of concordant CN, so yes, the lengths will be variable. Any similar CN and segmentation algorithm should be fine.
ADD REPLY
0
Entering edit mode

I will modify my pipeline based on your suggestion. Thanks! Haitao

ADD REPLY
0
Entering edit mode

Dear Chris,

I'm following what you mentioned in this post, I got bam-readcount output file now:

GL456392.1      772     C       123     =:0:0.00:0.00:0.00:0:0:0.00:0.00:0.00:0:0.00:0.00:0.00  A:0:0.00:0.00:0.00:0:0:0.00:0.00:0.00:0:0.00:0.00:0.00  C:103:30.22:36.83:1.46:46:57:0.57:0.06:145.63:46:0.46:64.46:0.33        G:0:0.00:0.00:0.00:0:0:0.00:0.00:0.00:0:0.00:0.00:0.00  T:20:31.10:37.00:0.00:11:9:0.78:0.09:182.55:11:0.59:55.90:0.34  N:0:0.00:0.00:0.00:0:0:0.00:0.00:0.00:0:0.00:0.00:0.00
GL456378.1      780     T       2       =:0:0.00:0.00:0.00:0:0:0.00:0.00:0.00:0:0.00:0.00:0.00  A:2:33.00:37.00:0.00:1:1:0.32:0.04:129.50:1:0.71:100.00:0.38    C:0:0.00:0.00:0.00:0:0:0.00:0.00:0.00:0:0.00:0.00:0.00  G:0:0.00:0.00:0.00:0:0:0.00:0.00:0.00:0:0.00:0.00:0.00  T:0:0.00:0.00:0.00:0:0:0.00:0.00:0.00:0:0.00:0.00:0.00  N:0:0.00:0.00:0.00:0:0:0.00:0.00:0.00:0:0.00:0.00:0.00

However, I don't know which tool can help me to convert bam-readcount output to like this VAF output:

chrom POS REF ALT REF_DP ALT_DP TOTAL_DP
GL456378.1 772 C A 103 20 123
GL456378.1 780 T A 0 2 2

Thanks in advance, Haitao

ADD REPLY
0
Entering edit mode
8.0 years ago
Haitao ▴ 30

Hi Chris,

I have one more question for building clonal evolution, since when I use sciclone to calulate subclone, I already remove those un-neutral region, after got subclone info, can I add those CNV changed region back to build evolution event? I ran THetA2 succesfully, I plan use sciclone + THetA2 to test the evolution. plot by fishplot. Thanks. Haitao

ADD COMMENT
0
Entering edit mode

There are options like Theta that allow for merging that information back in, but no supported procedure for merging the two that I know of. You're welcome to invent one, as it's probably possible.

ADD REPLY
0
Entering edit mode
3.0 years ago
2891913164 • 0

Hi Chris, I also want to know how to convert the bam-readcount output file to the vaf data required by sciclone. I hope to get your help! thank you very much!

ADD COMMENT
0
Entering edit mode

One way is to use something like VATools to annotate your VCF with that information, then extract it by parsing the VCF. https://github.com/griffithlab/VAtools

ADD REPLY

Login before adding your answer.

Traffic: 1389 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6