Nano-vLLM: How a vLLM-style inference engine works from Hacker News on 2026-02-02 12:52 (#73929) Comments