Hi,
I am trying to ask a query for a single position using tabix, but I am getting multiple variants in result. Interestingly, one variant's position is not same as my query position. I wonder if its expected, or if there is any way to avoid such scenario.
My query is like below:
tabix chr.all.vcf.bgz 9:136131322-136131322
Result looks like below:
9 136127800 MERGED_DEL_2_55992 TCCCCCAGGGAGGCGGGGCGCAGGGATTGCAGTGAGGCCCTGTGCCCAGGCTGGCTGTGCCCTACCTGCGGGAAGAGTCACTCCAGTCCCTCTGGGCTGGTCCAGGTGCAACCACAGTAGGACACAGGTCAACTCCAGTGAAATGTGGAGGGAGGAAGGGTGTGCCTGCCTGCTCCTTCCCCTCCCTTGCAGGGAGGGGCGGTTGCCCTCAGCAACAGAATGCCCACGTGGGATACTGGAAGCTTCAGCTTACCCCACCCCACCCCTCCAGGCCCGGTTTGTCCTGGGCGCAAGGGGCTACTTCAGAGCTGCCAGGCCTCCACAGCAACACATTAAATGTTCTGGAAACTAGGAGATGTGGCACTGCTGTACAACGGTCAGGAATAGCCATCCTGTCCTCCTGACCCGGTGAAAACCAGCTTCTGCTGGGAAGGAGCATGGGGTGGCGCCTGGCTTTTGGAGTCAGGAAGAACCAGGCTTAAGTAATAGAACTGCCTGACCTCCAGCTTGTCTCTTCAGCTCCCAGGTCTGTGCTCGATTTGGGATTATTGGGTGGGGCACATAAGAAGCTGCGTTCTGTGCTCTCAGGGTGCTGGACGCTGTTTCAGGTACCAGTACACACCAGAGGGAAGAGAGTGCCTGATGGCATGGTGATTGATTTTAGTGGAGACAGCTAGACACTAAACCAGGAGTTCTGTCATGCCAGTTGGTGATAAATTGATTTCTTGAAGATTTTTCCACTTCCAAGCAAGGTAGAATAGATGCACCTGTCCCCACGCCCTACACTAAGGACAGTTAAAATCTCTGTATATTTCCCTGGGTATTACACATACATATATAACTACATACATAAACGTATGTATATCAAACACTAAAAGGTGGAGAGAAGAAGGCAGACCATGTAGGGACCTTGGGACCTGAGGAATGACATGACAGGAGTTCCCTGGGTTTTCTTTCTGCCTCATATATCTGTGACTGTGTGCTGGAAAAGCCAGCAACACGAAACACCAAAAGACACAGACAAAAACCAACAACAACAAACCCAACAAGTTGGCCCTCAGCCAAAATAATTAGGCAAGAGGAAGAAATAAAAGGTATCCAAATTGGAAAGGAAGAACTTAAATTGTCCCTGTTTACAGATGACATGATCTTATAGAAAACCCTAAAGATTCCACCAATAAGATGTTAGAATAAATGAAATCAGTAAAGTTGCAGGATATAAAATTAACATACAACAATCTGTAGCATTTCTATACCCTAACAAAGAAATCAAGAAAACAATGTCATTTACAATAGCTACAAAAAATACTTGCAAATAAATCCAACCAAGGAAGTGAAAGATCTGTATGCTGAAAACTATAAAACATTTGTGAAAGAAATTGAAGATACAAATAATTAGAAAGCTGTCCCATATCTGAGATCAGGTGCCAAAAAAAAGACATCCCATGTTCATGAATTGGAAGAATTAATATTGTTAAAATGTCCATACAACCCCAAACAATCTACAGATTCAATGCAATCCCTATCAAAATTTCATTGACATTTTTCACTGAAATAAATAAAATAATCCTAAAATTCGTATGAGGGAGTAACACCTGCTCCACAGTGGCACAAAGCACTGTCAGGTGCCACACACAGCCTGTTGGGACATGTGGGCCACAGTACTTCCCCACCTAAAGTACTGTAGAGGCCCAGGAGACATGAGCCAGGGCCATTTTAACTGTCAGTAATGGCCACAAAGACCCCTAAATAGCCAAAGCAATCTTAAGCAAAAAGAACAAAGCTGGAGACAATACACTACCTAACTTCAAGATATACTATAAAGCTATAATAATCAAAACATCATGGTATTGGCATAAAAACAAACAGACCAATGAAACAGAATAGAGAGCCCGGAAGTGAATCCATGCATTTACGGTCAACTGATTTTTGACAGAAATGCCAAGAAACACAATGTGGAAACGACAGTCTGTTCAATAAATGATGTTGGGGCCAGGTGCAGTGGCTCATGCTTATAATCCTAGCACTTTGGGATGCCGAGGTGGAAGGACCACTTGAGGCCTCCCTCCAGGCTCCGGAAGAGACCTCCTCCATGATCCCTTGTCTAAGGGGAAGGTTCCTCAGGACCTTACCGTGGGGGCTGAAGGTGGCCACCCCTCCAGGAACTTATGCCCCAGGCGCTGAATTTGGGCTGCCTAAGTCTGTGTGCGTGAGTCTGTGTTTGTGTGCATGTCTGCATGTCTGTGTGTTTGCATGCATGTCTGTGTGTCTGTGTGGTCTATGTGTCTGTGTGTACACTTCTGTATGTCTTTCTCTGTGCATTTTTGCATGTGTCTCCATGTGTCTCTGTGCATGTCTGTGTGTCTATATGTCTGTGTCTTTGTATCTGTGTATCTGTGTCTGTGTGTCTTTGCGTGTCTGGTGTATGTCTGTGTGTGTGTGTATGTCTGTGTATGTAACGGTGTGTCTCTGTGGCGGGGGGTGTGTGTGTGATTGTGTGTGTGTGTGTGTCTGTGTATGTAATGGTGTGTCTCTGTGGCGGGGAGGGGCTGTGTGTGTCTTTCTGTGTATGTGTGTGTAACGGTGTGTCTCTATGGCCGGGAGGGGGTATCTGTGATTGTGTGTCTGTGTGTGTCTGTGTATGTAATGGTGTGTCTCTGTGGCGGGGAGGGGCTGTGTGTGATTGTGTGTCTGTGTGTGTCTGTGTATGTAATGGTGTGTCTCTGTGGCGGGGAGGGTGTGTGTGTGATTTTGTGTCTGTGTGTGTCTTTCTGTGTGTGTCTGTGTATGTAATGGTGTGTCTCTGTGGCGGGGAGGGGCTGCGTGTGATTGTGTGTCTGTGTGTGTCTTTCTGTGTATGTGTGTGTAATGGTGTGTTTCTGTGGCCGGAAGGGCGTATCTGCGATTGCGTGTCTGTGTATGTAATGGTGTGTCTCTGTGGCGGGGAGGGTGTGTGTGTGATTTTGTGTCTGTGTCTTTCTGTGTGTGTCTGTGTATGTAATGGTGTCTCTGTGGCGGGGAGGGTGTGTGTGTGATTTTGTGTCTGTGTGTGTCTTTCTGTGTGTGTCTGTGTATGTAATGGTGTCTCTGTGGTGGGGAGGGGGTGTGTGTGATTTGAGGTGGGGACGGGGCCTAGGCTTCAGTTACTCACAACAGGACGGACAAAGGAAACAGAGTTTACCCGTTCTGCTAAAACCAAGGGCGGGAGGGGGACGGGGCTGCCGGCAGCCCTCCCAGAGCCCCTGGCAGCCGCTCACGGGTTCCGGACCGCCTGGTGGTTCTTGGGCACCGCAGTGAACCTCAGCTTCCTCAGGACGGCGGGCCAGCCCAGCAGCTGCTGGTCCCACAAGTACTCGGGGGAGAGCACCTTGGTGGGTTTGTGGCGCAGCAGGTACTTGTTCAGGTGGCTCTCGTCGTGCCACACGGCCTCGATGCCGTTGGCCTGGTCGACCATCATGGCCTGGTGGCAGGCCCTGGTGAGCCGCTGCACCTCTTGCACCGACCCCCCGAAGAACCCCCCCAGGTAGTAGAAATCGCCCTCGTCCTTGGGGATGTAGGCCTGGGACTGGGGCCGGCGCTCGTAGGTGAAGGCCTCCCGGCTGCTTCCGTAGAAGCCGGGGTGCAGGGTGCCGAACAGCGGAGTCAGGATCTCCACGCCCACGTGGTCGCGGAACTCCATGTCCACGTCCACGCACACCAGGTAATCCACCTCGCTGAGGAAGCGCCGCTCGCAGAAGTCACTGATCATCTCCATGCGGCGCATGGACACGTCCTGCCAGCGCTTGTAGGCGCGCACCTCCAGCACTGACAGCTGCCGACCGGTCCCCAGCGTCACGCGGGGCACCGCGGCCGGCTGGTCGGTGAAGACATAGTAGTGGACACGGTGGCCCACCATGAAGTGCTTCTCCGCCGTCTCCAGGAACAGCTTCAGGAAAGCCACGTATCTGCAAGGCAGGCGGACGGGGGCTGGGGGAGCCGCCGGCCGTGCACCCCTGGGCTGCAGGAGGCCCGTCCTGCACCCGCCCGCCAGCGGCCATTGGAAGGCTTAGAGCAGCAGATGCACCACGTTCTCCTGCCCTGTCCTGAGCGAGTCCTCGGGCTGCGATTCACTTCATCCTCTTCCCAGCGATGGGGGACCACCAGCACCCCCTCTTACTAAGGAGGGCTGAGGGCAGGTGGCTGGAGGCTGGTAGCAGGCCGCAGGCTGGCGTCTGCTCACTCCCCCTCAGCCTGGCCTGAGCCACGCCTCCCCACGCAGCTGCCCCTCTTATGGCCAGGCCGGCCACGTGCTCCCTCATTATAAGCTGCACGCGAGGCCTCCACACACCCGCCTCTGCACCCTAGAGCTTCCTCCCTCCAGGCTTGAACTGCACCTATTCCTAAGAGTAAGTCATTCCTGGCCTCCGCCACTGTCGCTGGCCCAGCTGCCCACAGCTGCCGAGAAGTCAAGTATGTGTCTGCGGTTGCCTGGCTAGCTCCCTCTCTGGCCTGGCCCAGAGTCCCAGGGCCTTGTGGGTCAGCCACTTCCTTTGGTGTCTGGGGCCAACTGCTTTGCCTGCCCCACCTACATCTGACAGAGAAGTGACCACGGCTCTGCCAGCATCCTCTTTCTAGGGTCCAAGGACAGCAAACAGGTGTCCCCCTCCTGCTATCTCTGGTCAGTGAGCAGGAAACATCTGGAGCCTTGTATTGAGGGGGTGGCTCAGCATGACGGCCGGCCACAGTTACAGAGAGGAGGGGGCAGCAGAAGCCACCATCCCTGGGTGAGACGCAGCCTCTGGAGAAGGAGCTGGGTTTTACCGACCTGGCGAGCCCACGAGCCCACGAGCCCACATGAGCTCAGTAAGATGCTGCATGAATGACCTTTCCCATCTACCCTCTGGGAGGACAAGGCTGGCCGCCACCCCACTCTGTCTTGAACACAAGGAGAGACCTCAATGTCCACAGTCACTCGCCACTGCCTGGGTCTCTACCCTCGGCCACCTCACTGACTTACTTCTTGATGGCAAACACAGTTAACCCAATGGTGGTGTTCTGGAGCCTGAACTGCTCGTTGAGGATGTCGATGTTGAATGTGCCCTCCCAGACAATGGGAGCCAGCCAAGGGGTACCACGAGGACATCCTTCCTACTGCACATGGAGAGAGGCGTGCGGTCACATGGAGCTGGCAGGGTGCCACCCACATGCGCCTCTGGCACACGGCCGCCCCCACCTGGAAACTCCACTCAGCTTCTGCCTCCTCTGACCACCCTTCCAGAGGCAGCCGCCCTTCCCCGGGAAACCAACCAGAGGCAAATGCGACTCCAACCGGGCAAATCATTCCCAGCCCTCCCTCAACATTGGACCTGTGGGAACACACAGCAAGCTGAGCTTTGCTGGCAAAGAGATAGGAACAAACCCTCCCCAGCACCCAACCCCCGCTGCCCCTCCCCAGGTAGGAGGTACCTATCAGGCCTTTGCAGGGGCTTTGGAGAACAAAGGGACAGGAAACAAGAGACGCAAGTCAGAGAAAGCAAAGGGAAAGAGGACAGCCATGTGGGCCTCTGAATTCAGATGTCAGGAGAATCTGAGAGGAGAGAACGGGGAAGCAGCCCCAACTGAGATTTACATCAAGGAAACCGCCCTCTAATACCTTCAGAACAGCCCCTTGAGCTGCGTTCAGTTTCAGTGTCAGTAACTTTACTCACCACGGTGTCAGCACCTTTGACTGGGGGTAGACCATCCTGCAAGCACAAAGCGCCGCCACGTGAGTTTGCATGGAAAGCGTGGGATGCAGGTAAGCAGGGGGTGTGCACAGCCGCTGAACCATGACTGGGCATTGA T . info4;maf05 EXP_FREQ_A1=0.000;IMPINFO=0.000;CERTAINTY=1.000;TYPE=0;MISS=0;HW=1 GL:GT:DS 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 9 136131322 rs8176746 G T . PASS EXP_FREQ_A1=0.100;IMPINFO=1.000;CERTAINTY=1.000;TYPE=2;MISS=0;HW=0.60587 GL:GT:DS 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 0,1,0:0/1:1 1,0,0:0/0:0 1,0,0:0/0:0 0,1,0:0/1:1 1,0,0:0/0:0 1,0,0:0/0:0 0,1,0:0/1:1 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 1,0,0:0/0:0 0,1,0:0/1:1
I just realized that the ref and alt alleles' lengths are 5822 and 1, respectively in the first variant. And first variant's position (136127800) is within 5822 bp from my query position (136131322). Is it the reason I am getting multiple variants for my query position?
I am actually interested about SNPs. I would appreciate if anyone can suggest a way get only snps from tabix, without post-processing tabix result (I can do this).