Fast Llama 2 on CPUs with Sparse Fine-Tuning and DeepSparse by from Hacker News on 2023-11-23 04:44 (#6GKGP) Comments