With this command, I only get a vcf file containing numbers:
tabix -fh ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/release/20110521/ALL.chr11.phase1_release_v3.20101123.snps_indels_svs.genotypes.vcf.gz 11:62379194-62382592 >1000.vcf
Could you tell me what is going on wrong or how to interpret the output (first 11 columns from awk)?
11 62379317 rs181324718 G T 100 PASS ERATE=0.0004;LDAF=0.0013;AA=G;AN=2184;VT=SNP;THETA=0.0006;RSQ=0.7300;SNPSOURCE=LOWCOV;AC=2;AVGPOST=0.9992;AF=0.0009;ASN_AF=0.0035 GT:DS:GL 0|0:0.000:-0.01,-1.79,-5.00 0|0:0.000:-0.04,-1.06,-5.00
11 62379455 rs146374152 C T 100 PASS ERATE=0.0004;AA=C;AN=2184;VT=SNP;RSQ=0.5436;THETA=0.0056;SNPSOURCE=LOWCOV;AC=1;AVGPOST=0.9992;LDAF=0.0008;AF=0.0005;AFR_AF=0.0020 GT:DS:GL 0|0:0.000:-0.10,-0.69,-4.70 0|0:0.000:-0.00,-2.48,-5.00
11 62379545 rs139680986 C T 100 PASS ERATE=0.0004;AA=C;AN=2184;VT=SNP;THETA=0.0006;LDAF=0.0007;SNPSOURCE=LOWCOV;AC=1;RSQ=0.6432;AVGPOST=0.9995;AF=0.0005;ASN_AF=0.0017 GT:DS:GL 0|0:0.000:-0.01,-1.49,-5.00 0|0:0.000:-0.00,-2.59,-5.00
Can you show us some of the output?
There should be some # in the "head" of this vcf file to walk you through these values.. normally as one would hope the vcf file will have the variants calls for all the samples in columns. Perhaps a column count, row count and walking through the # should offer some insight.