AI researchers run AI chatbots at a lightbulb-esque 13 watts with no performance loss — stripping matrix multiplication from LLMs yields massive gains

from Latest from Tom's Hardware on 2024-06-26 15:04 (#6NSM5)

Elimination of matrix multiplication from LLM processing can massively increase performance-per-watt with the correct optimizations, researchers from UC Santa Cruz demonstrate. It remains to be seen how applicable this approach is for AI in general.

Source	RSS or Atom Feed
Feed Location	https://www.tomshardware.com/feeds/all
Feed Title	Latest from Tom's Hardware
Feed Link	https://www.tomshardware.com/