Entering edit mode
6.2 years ago
yy1036832160
•
0
Hello,how to use CNVKit’s .cnn and .cnr files as input file for PureCN to estimate purity? Can you give an example? Thank you
Hello, I want to use results from gatk pipeline, do you have any ideas how to do in
PureCN
? I see no example inPureCN
overview and quickstart. Thanks.It currently does not take all GATK4 output files out-of-the-box, but you can provide the segmentation and copy number log-ratios (see Section 10.1). PureCN can read GATK4 coverage files (in hdf5 format). Simply provide the tumor coverage and PureCN will be able to map provided log-ratios to the genomic coordinates (no need to generate and provide an interval.file).
Btw, PureCN implements the GATK4 coverage normalization with added support for sex chromosomes and off-target regions. There are differences in GC-normalization and segmentation though.
Thanks @markus.riester. I tried to run a small test but some error raised.
My code
Error message
My coverage file
*.hdf5
comes from gatk CollectReadCounts . Error message says.readCoverageGatk3
does this command suits gatk4 hdf5 format?My interval file is picard style from gatk PreprocessIntervals not from
PureCN
.When you calculate the log-ratio like that, you are not using GATK4's denoising steps. GATK4 should generate a file with log2-ratio, you want to parse that file and provide the corresponding log2-ratio like you did.
The issue is likely the wrong format of the interval file. Just run it with:
If you want to use GATK4's segmentation, provide it via seg.file (you might need to change the format for now, but I will add support for that soon, see the main vignette).
PureCN version 1.15.4 now added support for the GATK4 copy number workflow.