Llamafile 0.8.1 GPU LLM Offloading Works Now With More AMD GPUs
It was just a few days ago that Llamafile 0.8 released with LLaMA 3 and Grok support along with faster F16 performance. Now this project out of Mozilla for self-contained, easily re-distributable large language model (LLM) deployments is out with a new release...