Entering edit mode
10.1 years ago
Lee Katz
★
3.2k
Hi Biostars, I was wondering how I might backfill some information into the VCF format. Here are the first three lines of a VCF pooled file I created with bcftools merge
.
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT Sample1 2:Sample1 3:Sample1 4:Sample1 5:Sample1 6:Sample1 7:Sample
NC001416 1 . G . 0 PASS ADP=3;WT=0;HET=0;HOM=0;NC=1 GT:GQ:SDP:DP:RD:AD:RBQ:ABQ:RDF:RDR:ADF:ADR ./.:.:3:.:.:.:.:.:.:.:.:
NC001416 2 . G . 0 PASS ADP=3;WT=0;HET=0;HOM=0;NC=1 GT:GQ:SDP:DP:RD:AD:RBQ:ABQ:RDF:RDR:ADF:ADR ./.:.:3:.:.:.:.:.:.:.:.:
NC001416 3 . G . 0 PASS ADP=6;WT=0;HET=0;HOM=0;NC=1 GT:GQ:SDP:DP:RD:AD:RBQ:ABQ:RDF:RDR:ADF:ADR ./.:.:6:.:.:.:.:.:.:.:.:
One thing I would want to backfill is the DP attribute for each sample. Would I use bcftools annotate? How? Is there an example or tutorial out there I could follow? I have all the original bam files, so I can supply that information to a given script.
Have you tried the Squaring off utility of bcbio.variation.recall?
I think that
bcbio.variation.recall
does the same thing thatbcftools merge
does. Right? I am not sure if I can use it to backfill information such as DP (depth).