Hi, I am newbie in CNVKit
I want to analyse clinical exome panel sample in CNVkit to detect copy number variants. When I tried with bed file that include 113k row, I get 60k targeted CNV. When I filter the result according to log2 ratio and morbid genes, I get 20k result.
I confused about the number of CNV. I don't believe the trustability of my results. Can anyone say something about it?
Try following the
batch
command from CNVkit pipeline. Another good ideia is to also runExomeDepth
and look into intersections of both callers!Pretty impossible for anyone to comment without additional details, including the code you ran and plots of CN ratio for your segments across the genome.
Is it germline analysis or somatic? I think CNVkit is the best for Somatic CNVs calling. For germline it may be better to use something like ExomeDepth (or ClinCNV).
It is germline.
I applied the standard CNVkit pipeline. My codes:
Target
cnvkit.py target hg38_file.bed --split -o my_targets.bed
Anti-target
cnvkit.py antitarget my_targets.bed -g access-5kb.hg10.bed -o my_antitargets.bed
Coverage
cnvkit.py coverage my_targets.target.bed -o Sample.targetcoverage.cnn cnvkit.py coverage Sample.bam my_targets.bed -o Sample.antitargetcoverage.cnn
Reference
cnvkit.py reference -o Reference.cnn -f Homo_sapiens_assembly38.fasta -t Sample.targetcoverage.cnn -a Sample.antitargetcoverage.cnn
(Although I also tried reference cnn that I create from multiple sample, I get 15-20k result again)
fix
cnvkit.py fix Sample.targetcoverage.cnn Sample.antitargetcoverage.cnn Reference.cnn -o Sample.cnr
Sample cnr has approximatelly 20k result...is this problem?