Greetings, I've seen related threads, but none that seem to point out what I'm overlooking here.
I am trying to bgzip and then tabix index a small VCF file:
##fileformat=VCFv4.1
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT GS000016660-ASM
1 1413232 . CGC TGT . . NS=1;AN=2 GT:PS:FT:GQ:HQ:EHQ:CGA_CEHQ:GL:CGA_CEGL:DP:AD:CGA_RDP 1|0:1413210:PASS:51:166,51:163,47:29,20:-166,0,-51:-29,0,-20:20:10,10:10
1 1469598 . CG GC . . NS=1;AN=2 GT:PS:FT:GQ:HQ:EHQ:CGA_CEHQ:GL:CGA_CEGL:DP:AD:CGA_RDP 1/1:.:PASS:68:68,615:68,615:35,35:-615,-68,0:-35,-35,0:38:38,38:0
1 1875858 . CA GG . . NS=1;AN=2 GT:PS:FT:GQ:HQ:EHQ:CGA_CEHQ:GL:CGA_CEGL:DP:AD:CGA_RDP 1/1:.:PASS:35:390,35:390,35:33,33:-390,-35,0:-33,-33,0:23:22,22:1
1 2116593 . CG GC . . NS=1;AN=2 GT:PS:FT:GQ:HQ:EHQ:CGA_CEHQ:GL:CGA_CEGL:DP:AD:CGA_RDP 1/0:.:VQLOW:21:21,21:4,4:0,17:-21,0,-21:0,0,-17:14:2,12:12
I run bgzip
bgzip myVcf.vcf
and then tabix
tabix myVcf.vcf.gz
and get errors that I cannot figure out:
[E::get_intv] Failed to parse TBX_VCF, was wrong -p [type] used?
The offending line was: "#"
[E::get_intv] Failed to parse TBX_VCF, was wrong -p [type] used?
The offending line was: ""
[E::get_intv] Failed to parse TBX_VCF, was wrong -p [type] used?
The offending line was: ""
any ideas would be greatly appreciated.
TBX_VCF
in the error message indicates that the file has been identified (via its filename) as VCF, so adding‑p vcf
will make no difference. Following your steps with the small VCF file as shown works for me. So you would need to attach the actual file for us to help you work out where the blank lines and single-#
-character lines that tabix is seeing are.Thank you John. File is here: https://drive.google.com/file/d/1PqFrl-VpPFvMCQZhEv8baCxkIR2zdK3O/view?usp=sharing
I ran the latest version of snpEff, on a windows machine. I'm running tabix on ubuntu. Appreciate your help.