LLMLingua: Compressing Prompts for Faster Inferencing by from Hacker News on 2023-12-18 23:05 (#6H840) Comments