Cutting inference cold starts by 40x with LP, FUSE, C/R, and CUDA-checkpoint from Hacker News on 2026-05-18 17:56 (#75PY4) Comments