Varscan Error On Mpileup Input
2
0
Entering edit mode
12.0 years ago
GPR ▴ 390

Hello, I am trying to filter low quality data off my mpileup files using Varscan but keep getting stuck with the error message: Invalid format for pileup at line 1. I don't think my vcf tools are corrupted, since they have made it through bedtools instersect.

The command lines I have used are the following:

samtools mpileup -uf genome.fa 1.bam 2.bam | bcftools view -bvcg -> *.bcf

I also used -vcg -> *.vcf and even ommitted -u. For VarScan I am typing:

java -jar VarScan.jar mpielup *.vcf  --output-vcf

Any tip to as why is my vcf file not recognized by VarScan will be appreciated.

varscan • 6.7k views
ADD COMMENT
1
Entering edit mode
12.0 years ago
Irsan ★ 7.8k

Do you want to convert the samtools mpileup format to variant calls and filter them for variant allele frequency, #reads supporting variant/reference, etc? Then the command you should use is

samtools mpileup -uf genome.fa 1.bam 2.bam | java -jar /path/to/VarScan.jar mpileup2snp --output-vcf > your_sample.vcf

So not

java -jar VarScan.jar mpielup *.vcf --output-vcf

So, it expects a .pileup-file, not vcf

ADD COMMENT
0
Entering edit mode

Thanks, I went ahead with your recommendation, but I get the same error: invalid format at line 1. My bam files have a header, are sorted, and the duplicates removed.I also tried adding Read Groups, which are needed to get GATK working. Any other suggestions?

ADD REPLY
0
Entering edit mode

The job actually went through when taking -u off in samtools mpileup . Thanks again!

ADD REPLY
0
Entering edit mode

Nice you have sorted it out!

ADD REPLY
1
Entering edit mode
12.0 years ago
dankoboldt ▴ 140

Irsan, thanks for helping out!

ADD COMMENT

Login before adding your answer.

Traffic: 2701 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6