Entering edit mode
16 months ago
NCBI and NIAID recently completed a successful codeathon focusing on methods for computing on the scale of millions of records using VCF files as inputs, with SARS-CoV-2 VCFs serving as a case study. This event brought together experts on viral evolution, molecular epidemiology, and population genomics who worked on a variety of projects.
For more details see our NCBI Insights post. Check out the project results on GitHub.
fyi, I went ahead and took care of the vcf2phylip task for ploidy <= 2 last week, by adding a phylip exporter to plink2 that is functionally identical to vcf2phylip.py while being hundreds of times as fast.