NVIDIA has launched the Cosmos 3 world model, making two versions available for download: Super and Nano. The Super version, with 64.6 billion parameters, is designed for applications requiring high physical accuracy, such as post-training robotics and autonomous driving. The Nano version, featuring 15.7 billion parameters, is optimized for low-latency scenarios like high-quality video and action reasoning. Both versions are accessible on Hugging Face and build.nvidia.com, supporting deployment as NVIDIA NIM microservices.
Cosmos 3 is a multimodal foundational world model for physical AI, utilizing a Mixture of Transformers architecture to understand and generate text, images, video, environmental sounds, and actions. NVIDIA describes it as the first fully open multimodal model, allowing developers to download, fine-tune, and convert it into proprietary models. An Edge version for real-time inference is expected soon.
NVIDIA Releases Cosmos 3 World Model with Super and Nano Versions
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
