Nvidia is set to boost AI server demand with its new Vera Rubin platform, expected to ramp up in the second half of 2026. The platform, succeeding the Blackwell architecture, promises significant improvements, including 10x lower inference token costs and 4x fewer GPUs needed for training mixture-of-experts models. Performance-per-watt is projected to improve by up to 50x over Blackwell.
The Rubin platform is currently in production at TSMC, with six new chips slated for mass production in late 2026. Major cloud providers like AWS, Google Cloud, and Microsoft Azure are preparing to integrate Rubin-based instances, with Microsoft planning extensive deployments. This development could impact TSMC's capacity and intensify competition among chip designers, including AMD and custom silicon from Google and Amazon.
Nvidia's Rubin Platform to Drive AI Server Demand Surge in Late 2026
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
