I have multiple vcf of CASES and CONTROLS variations annotated by VEP, SNPEff, SnpSift.
first pair vcf -> only variations| CASES and CONTROLS
second pair vcf -> variations + SnpEff | CASES and CONTROLS
third pair vcf-> variations + SnpEff+VEP+SnpSIFT| CASES and CONTROLS | at so on
Because it have non-overlapping samples, i tried to merge corresponding vcf of CASE and CONTROLS into one file with bcftools merge.
bgzip -c file.vcf > file.vcf.gz
tabix -p vcf file.vcf.gz
bcftools merge -o merged.vcf.gz -Oz file.vcf.gz file_2.vcf.gz
Unfortunatelly in files with SnpSift annotations i got this error:
Error at chr1:20724637: wrong number of fields in dbNSFP_ada_score?
Please anybody knows how to solved this error or what does it mean?
what is the output of the following cmds:
grep header:
INFO=<ID=dbNSFP_ada_score,Number=A,Type=String,Description="Field 'ada_score' from dbNSFP">
INFO=<ID=dbNSFP_ada_score,Number=A,Type=String,Description="Field 'ada_score' from dbNSFP">
-G on chr1:
CHROM POS ID REF ALT QUAL FILTER INFO
chr1 20724637 rs1280663306;rs1445195248 TGGGAGGGAGGGAGAGAGGT TGGGAGGGAGAGAGGT,TGGGAGGGAGGGAGGGAGGT,TGGGAGGGAGGGAGAGAGGG,TGGGGGGGAGGGAGAGAGGT,TGGGTGGGAGGGAGAGAGGT,TGGGAGAGAGGGAGAGAGGT 2.42927 . ANN=TGGGGGGGAGGGAGAGAGGT|splice_region_variant&intron_variant|LOW|SH2D5|SH2D5
...SNV|1||||||||||||||||||||||||||||||||||||||||||||;dbNSFP_ada_score=1.43020658327008E-4,0.00157457299236331,5.16496450143413E-5;dbNSFP_rf_score=0.016,0.054,0.0;dbNSFP_PHRED=5.632,0.077,6.365,7.459,2.716
CHROM POS ID REF ALT QUAL FILTER INFO
chr1 20724637 rs1280663306;rs1445195248 TGGGAGGGAGGGAGAGAGGT TGGGAGGGAGAGAGGT,TGGGAGGGAGGGAGGGAGGT,TGGGAGGGAGGGAGAGAGGG,TGGGGGGGAGGGAGAGAGGT,TGGGTGGGAGGGAGAGAGGT,TGGGAGAGAGGGAGAGAGGT 2.42927 . ANN=TGGGGGGGAGGGAGAGAGGT|splice_region_variant&intron_variant|LOW|SH2D5|SH2D5|transcript|NM_001103161.2|protein_coding|5/9|..
..||||||||||||||||||||||||||;dbNSFP_ada_score=1.43020658327008E-4,0.00157457299236331,5.16496450143413E-5;dbNSFP_rf_score=0.016,0.054,0.0;dbNSFP_PHRED=5.632,0.077,6.365,7.459,2.716
I had to cut it because it wouldnt be readeable and i think it would be too long for response. I ll try do add it into question.
Unfortunately content is over limit. Still only one differrence i have there dbNSFP annotations from others is separated by ; instead |
My last idea is that it could be only some difference beetween CASE and CONTROLS files.