VPTQ: Extreme low-bit Quantization for real LLMs by from Hacker News on 2024-10-21 14:12 (#6RME3) Comments