vLLM: 24x faster LLM serving than HuggingFace Transformers by from Hacker News on 2023-06-20 19:17 (#6CAX5) Comments