GLM-5.1 Tops Open-Source Models in Coding Agent Benchmark

GLM-5.1 has emerged as the leading open-source model in the Artificial Analysis Coding Agent Benchmark, according to a report by Artificial Analysis. The benchmark evaluates model performance on three key tests: SWE-Bench-Pro-Hard-AA, Terminal-Bench v2, and SWE-Atlas-QnA, which simulate real-world programming and technical tasks. While the proprietary Opus 4.7 model secured the top global position, GLM-5.1, operating on Claude Code, led among open-source models, showcasing its advanced capabilities in programming agent scenarios.

出典: 原文を表示

免責事項: Phemexニュースで提供されるコンテンツは、あくまで情報提供を目的としたものであり、第三者の記事から取得した情報の正確性・完全性・信頼性について保証するものではありません。本コンテンツは金融または投資の助言を目的としたものではなく、投資に関する最終判断はご自身での調査と、信頼できる専門家への相談を踏まえて行ってください。

​​こちらもおすすめ​​

こちらもおすすめ