Gemini 3 Flash has emerged as the leading AI model in the OpenClaw agent task, achieving a 95.1% success rate according to the PinchBench benchmark test. The evaluation, highlighted by SlowMist CISO 23pads on the X platform, places minimax-m2.1 and kimi-k2.5 in second and third positions with success rates of 93.6% and 93.4%, respectively. Claude Sonnet 4.5 follows with 92.7%, while GPT-4o records an 85.2% success rate.
Gemini 3 Flash Tops OpenClaw Task Performance with 95.1% Success Rate
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
