FriendliAIPartners with NVIDIA on Nemotron 3 for Agentic AI Inference

Redwood City, CA -FriendliAI, an AI inference platform company, announced a partnership with NVIDIA to launch the Nemotron 3 model family, available on FriendliAI's Dedicated Endpoints.Developers can deploy Nemotron 3 models on FriendliAI's inference platform. Highlights include: Up to 13* faster token generation via hybrid Mamba-Transformer MoE architecture and multi-token prediction (MTP) technique MoE routing [...]
The post FriendliAIPartners with NVIDIA on Nemotron 3 for Agentic AI Inference appeared first on Inside HPC & AI News | High-Performance Computing & Artificial Intelligence.