by Brian Wang from NextBigFuture.com on (#6RNV7)
New Linear-complexity Multiplication (L-Mul) algorithm claims it can reduce energy costs by 95% for element-wise tensor multiplications and 80% for dot products in large language models. It maintains or even improving precision compared to 8-bit floating point operations. By replacing complex floating-point multiplication with integer addition Solution in this Paper - Approximates floating-point multiplication using ... Read more