Entering edit mode
8.3 years ago
ehzed
▴
40
Hello,
I am running one of the most basic commands, found on the lumpy webpage:
lumpyexpress -B my.bam -S my.splitters.bam -D my.discordants.bam -o output.vcf
to call copy number variants.
One of the warnings that I received was "Warning: only 0 elements in distribution (min:1000). I dug around and found the actual code. Only problem is, I am a beginner in python and I am having trouble identifying what the elements are other than that they are in the L array. I was wondering if anyone have had experienced this problem before, what the "elements" are, and whether this will affect my results. I am still getting a growing vcf file as the output as we speak. Thanks!
pairend_distro.py needs at least 1000 alignments to determine the stats that lumpy uses to id breakpoints. These alignments must:
have the flags "read paired", "mast reverse strand", "first in pair" set
not have the flags "read unmapped", "mate unmapped", "read reverse strand", "second in pair", "not primary alignment", "read is PCR or optical duplicate", "supplementary alignment"
have an insert size >= 0
have the chrom of both the alignment and mate match
In your case, none of the first 1000000 alignments met all of these criteria.
What did you use to align your data?
How did you align your bam?
Maybe you have single-end data and it expects paired-end data?
Hi trausch, I have paired end data