Hi all,
When I used GATK UnifiedGenotyper to call variants from three Neanderthal BAM files, there was an error message for the calling,
$ java -jar GenomeAnalysisTK.jar -T UnifiedGenotyper -R human_ucsc_hg19.fasta -I SLVi33.16.hg19.sorted.bam -I SLVi33.25.hg19.sorted.bam -I SLVi33.26.hg19.sorted.bam -o snps.calling.vcf
##### ERROR ------------------------------------------------------------------------------------------
##### ERROR A USER ERROR has occurred (version 1.2-2-g8143def):
##### ERROR The invalid arguments or inputs must be corrected before the GATK can proceed
##### ERROR Please do not post this error to the GATK forum
##### ERROR
##### ERROR See the documentation (rerun with -h) for this tool to view allowable command-line arguments.
##### ERROR Visit our wiki for extensive documentation http://www.broadinstitute.org/gsa/wiki
##### ERROR Visit our forum to view answers to commonly asked questions http://getsatisfaction.com/gsa
##### ERROR
##### ERROR MESSAGE: SAM/BAM file SAMFileReader{SLVi33.16.hg19.sorted.bam} is malformed: Error: the number of base qualities does not match the number of bases in M_SOLEXA-GA04_JK_PE_SL19_repeat:6:56:1719:1324.
##### ERROR ------------------------------------------------------------------------------------------
$ samtools view SLVi33.16.hg19.sorted.bam | grep "M_SOLEXA-GA04_JK_PE_SL19_repeat:6:56:1719:1324"
M_SOLEXA-GA04_JK_PE_SL19_repeat:6:56:1719:1324 0 chr1 74071 0 53M * 0 0 CACCTATGAGTGAGAATATGCGGTGTTTGGTTTTTTGTTCTTGCGATAGTTTA * RG:Z:SLVi33.16 UQ:i:0
So, my question is, how can I fix this problem? Meanwhile, does anyone call the Neanderthal SNPs from BAM files downloaded from UCSC, ftp://hgdownload.cse.ucsc.edu/gbdb/hg19/neandertal/seqAlis/SLVi33.16.hg19.bam ftp://hgdownload.cse.ucsc.edu/gbdb/hg19/neandertal/seqAlis/SLVi33.25.hg19.bam ftp://hgdownload.cse.ucsc.edu/gbdb/hg19/neandertal/seqAlis/SLVi33.26.hg19.bam
However, maybe someone has the better protocol to call these Neanderthal SNPs. Thanks in advance.
does any line of your bam file contain qualities at all?
Yes, most of the lines in the BAM file contain qualities for each bases at the line.