Entering edit mode
5.7 years ago
kennyworkman
•
0
Hi, I'm new at this and trying to construct a sequence validation pipeline.
I'm using this command to call variants in my script:
bcftools mpileup -f "$1" "$2" | bcftools call -mv -Ob -o temp_variants.bcf
When calling bam files with a double deletion, it is able to detect the variant. However, when there is only a single deletion in the bam file, the variant file is empty?
Any ideas/help would be much appreciated!
not clear, show us the VCF lines and the reads.
Sorry, here's the VCF lines from a file with a double deletion:
Here's the VCF file from a bam file with a single deletion (empty):
And here's the read that I'm referencing:
It has a depth of 5, so I'm assuming there is some sort of variant call here?
Still not clear. That's not a read, that another line of a vcf file.
What is considered a "read"?
Kenny, that entry that you pasted was a single nucleotide variant record - these are stored per row in the VCF/BCF file. The aligned reads are stored in your BAM file.
The fact that this deletion is in a homopolymer of T bases is worrying - these are usually false positive calls, particulary at read depths as low as 5. Are you sure that the call is genuine, i.e., was it confirmed by a companion in vitro method?
To interpret that
<*>
, take a look here: What does <*> mean in a vcf file?Hello kennyworkman ,
what are the values for
"$1"
and"$2"
?fin swimmer
Yeah, that's not clear. Just whatever .bam file and reference fasta file I pass into the script.