Altough hg19 and b37 being considered similar they have some differences that affect the pre processing when doing variant calling.
Is there any tool to convert the references for panel of normals or know-sites that are applied in Mutect for the several prepreprocessing steps from b37 into hg19?
I have read about liftover but I am not sure if it works for .vcf files suggested in https://gatk.broadinstitute.org/hc/en-us/articles/360035890811-Resource-bundle such as 1000G_phase1.indels.b37.vcf, Mills_and_1000G_gold_standard.indels.b37.sites.vcf or the Mutect2-WGS-panel-b37.vcf .
you don't need those files for https://gatk.broadinstitute.org/hc/en-us/articles/360037060932-LiftoverVcf-Picard- you just need a VCF , a chain , and a reference.
What is
b38
? Do you mean to sayGRCh38
?If that is the case they are only
similar
in the extent that they are both human genome builds. Other than that they are substantially different (which is the reason why they are "major" genome builds).If you are looking for GRCh37/hg19 GATK resource files they should be available using the answer here: gatk legacy bundles (where to get Mills_and_1000G_gold_standard.indels.hg19.sites.vcf.gz)
If you need the GATK resource files for GRCh38 they are available here: https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0/
I edited post, I meant b37. I have read the gatk legacy bundles (where to get Mills_and_1000G_gold_standard.indels.hg19.sites.vcf.gz) it came to the conclusion that the b37 files are not compatible with hg19 therefore the only way is to convert the b37 resources into hg19 format, as Pierre and Raphael mentioned?
There is no difference in the main chromosomes between b37 and hg19 (except Chr Y and MT, see table below) . Unless you are doing something very specific, it may be reasonable to use hg19 files.
See: https://gatk.broadinstitute.org/hc/en-us/articles/360035890711-GRCh37-hg19-b37-humanG1Kv37-Human-Reference-Discrepancies#comparison
I understand however having the b37 version would make these two lists the same size and also fix the chr1 to 1 issue (which I understand can be just solved by converting) but, again, even if converting the sizes of lists would be different due to the differences you mentioned (https://gatk.broadinstitute.org/hc/en-us/articles/360035890711-GRCh37-hg19-b37-humanG1Kv37-Human-Reference-Discrepancies#comparison - Reference Table). Therefore since there are no available files for hg19 the only reasonable option is to convert b37 files to hg19?
try crossmap https://crossmap.sourceforge.net/ it works for vcf files