Cerebras Systems has achieved a significant milestone by serving Moonshot AI's Kimi K2.6 model at 981 output tokens per second, outperforming the next-best GPU cloud provider by 6.7 times. This performance was independently verified by Artificial Analysis. The Kimi K2.6, a 1-trillion-parameter Mixture-of-Experts model, was released on April 20, 2026, and features multimodal and agentic capabilities.
The Cerebras-powered setup demonstrated a 29x improvement in end-to-end latency on a representative coding workload compared to the official Kimi endpoint. This achievement underscores the capabilities of Cerebras's Wafer-Scale Engine, which offers over 200 times the bandwidth of NVIDIA's NVLink. Following its IPO in May 2026, valued at $95 billion, Cerebras is proving its hardware's ability to efficiently handle large AI models.
Cerebras Systems Outpaces GPU Cloud with 981 Tokens/Second on Kimi K2.6 Model
免責事項: Phemexニュースで提供されるコンテンツは、あくまで情報提供を目的としたものであり、第三者の記事から取得した情報の正確性・完全性・信頼性について保証するものではありません。本コンテンツは金融または投資の助言を目的としたものではなく、投資に関する最終判断はご自身での調査と、信頼できる専門家への相談を踏まえて行ってください。
