VPTQ: Extreme low-bit Quantization for real LLMs from Hacker News on 2024-10-21 14:12 (#6RME3) Comments