Entering edit mode
7 months ago
Sd
•
0
I am using the following tabix command to slice a GVCF file, but it is not creating sliced GVCF files. I have a bed file for the hg38 genome reference with 1MB regions and 1kb padding, and I want to slice using the bed file. I was wondering if you could help me with how to do it?
./tabix -R regions.bed -p vcf sample.g.vcf.gz > sample_sliced.gz
I want to run gatk
GenomicsDBImport
tool to build a GenomicsDB. I want to slice it to reduce the computation time since I have 1300 gvcf files. What is your suggestion to runGenomicsDBImport
efficiently?call the GVCF by slices eg: chr1:1000000-2000000
run GenomicsDBImport for each sample for one interval
genotype gvcf for one interval
at the end concatenate all the VCFs
Then what is the best way to do slicing for 1Mb windows? Doing
--intervals
in the GenomicsDBImport?No, in haplotypecaller
I already have gvcfs called by chromosome and there are 24 gvcfs per sample which I merged them into a single gvcf per sample. Now I want to slice them to smaller chunks.