KVarN: Native vLLM KV-cache quantization back end by Huawei by from Hacker News on 2026-06-04 15:18 (#7636K) Comments