Efficient LLM inference solution on Intel GPU by from Hacker News on 2024-01-20 17:11 (#6J0GJ) Comments