So first I downloaded files like vcfBeta-NA10851-200-37-ASM.vcf.bz2 from the complete genomics website. Then I had to convert it to gz and index by tabix for fast extraction of variation in chromosome positions. I realized that I have to do the same for 69 genomes which is tiresome.
Then I downloaded CompletePublicGenomes69genomesall_testvariants.tsv.bz2. After unzipping, the file size is 7.2 GB. Let me know how can I work with this file since the format is unknown to me. Thank you.
Thank you for your suggestion. I actually tried sed which seemed to be too slow for such huge files. I am worndering if it would be possible to convert it to vcf and index by tabix for fastest access to chromosome regions.
You do not need to convert to VCF. Tabix will happily index any sorted, bgzipped file given the correct arguments for chromosome, start, and end columns.