Accelerating Gemma 4: faster inference with multi-token prediction drafters from Hacker News on 2026-05-05 16:14 (#75DPT) Comments