Does DeepSeek Impact the Future of AI Data Centers?
by Brian Wang from NextBigFuture.com on (#6VAG0)
China's DeepSeek has made innovations in the cost of AI and innovations like mixture of experts (MoE) and fine-grain expert segmentation which significantly improve efficiency in large language models. The DeepSeek model activates only about 37 billion parameters out of its total 600+ billion parameters during inference, compared to models like Llama that activate all ...