Syntax for ContEst program to identify sample contamination
0
0
Entering edit mode
8.5 years ago
MAPK ★ 2.1k

I have been running this tool from GATK ( https://www.broadinstitute.org/cancer/cga/contest_run ) as this following command below where snps.raw.vcf is the VCF file for common SNPs,bam_156samples.list is the list of bam files (full path), sample.txt is the list of samples to be used from VCF file, and hapmap_3.3.hg19.vcf is the population data. The output (all_samples_test2_contaminated.txt) I get from this command is empty with the header line only. Can someone please tell me what is wrong with the command below?

command:

java -jar ContEst.jar -T Contamination -B:genotypes,vcf /mypath/snps.raw.vcf -BTI genotypes -B:pop,vcf /mypath/hapmap_3.3.hg19.vcf -I /mypath/bam_156samples.list -R /mypath/hg19.fasta -pc 0.1 -o all_samples_test2_contaminated.txt -sn sample.txt

all_samples_test2_contaminated.txt output file contains only the header line:

name    population  population_fit  contamination   confidence_interval_95_width    confidence_interval_95_low  confidence_interval_95_high sites
ContEst • 2.0k views
ADD COMMENT

Login before adding your answer.

Traffic: 1562 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6