Google has unveiled Gemini Omni at its I/O 2026 event, a groundbreaking multimodal generative model designed to enhance AI-driven video creation and editing. This new model integrates Google's Gemini reasoning backbone with its media engines, enabling the creation of videos from simple inputs. DeepMind CEO Demis Hassabis described Omni as a step towards artificial general intelligence, highlighting its ability to create content from any input.
Gemini Omni Flash, the first public version, will be available through Google's AI filmmaking platform, Flow, and Flow Music for music projects. The model supports conversational editing, allowing users to make broad changes using natural language. Demonstrations included a claymation explainer and a selfie video with added visual elements. For the crypto and Web3 sectors, Omni could transform NFT production and on-chain storytelling, while also raising challenges in provenance and content moderation.
Google Launches Gemini Omni to Revolutionize AI Video Creation
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
