Luo Fuli, head of Xiaomi's large model team, announced a significant shift in the large model landscape from the Chat era to the Agent era, emphasizing post-training. This transition has altered compute allocation strategies, with the pre-training to post-training compute ratio now reaching 1:1 among leading teams. Previously, the ratio was 3:5:1 during the Chat era. Luo noted that the focus is now on scaling reinforcement learning for Agents, necessitating changes in system architecture to support complex workflows and heterogeneous cluster scheduling.
Xiaomi's Luo Fuli Highlights Shift to Post-Training Era in Large Models
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
