Running large language models like ChatGPT on a single GPU by from Hacker News on 2023-02-20 16:55 (#691SC) Comments