I have recently run into the issue of finding relatively recent databases and papers which align to the old reference assembly (HG 19 / GRhC37).
I understand that it is non-trivial to change alignments, but this assembly was last patched in 2013. For novel projects, especially, it should be trivial to use the latest assembly.
Can someone please help me understand why new projects are using the older assembly?
GRCh37, IMO, is the equivalent of IBM mainframe computers, especially with clinical informatics. It is a sequence that works well enough to not be a pain to most research folks. IIRC, even NCBI/dbSNP only made the switch relatively recently. I think GRCh37 is going to be around for a few more years at least. It will take major tools deprecating support for GRCh37 for the forced switch process to start.