Sparse LLM Inference on CPU: 75% fewer parameters by from Hacker News on 2023-10-19 03:13 (#6FPKQ) Comments