Two different tricks for fast LLM inference by from Hacker News on 2026-02-15 09:27 (#73K23) Comments