Gemini's new video model, 'Omni,' has been discovered by users ahead of its official launch, with reports surfacing on Reddit about its capabilities. Users noted that the model, labeled "Powered by Omni," appeared in the Gemini app, offering advanced video generation features. One user praised Omni for its prompt adherence and seamless multi-camera transitions, highlighting its superior audio and ambient sound quality compared to the existing Veo series.
Despite the praise, users reported strict rate limits, with Pro subscribers exhausting 80% of their quota after generating just two videos. Additionally, safeguards still block celebrity likenesses, as tests with Will Smith's likeness failed. The discovery suggests Google may be integrating text, image, and video generation into a single architecture, aligning with DeepMind CEO Hassabis's previous statements about merging Gemini and Veo. An official announcement is anticipated at the Google I/O conference on May 19.
Gemini's 'Omni' Video Model Discovered Ahead of Launch, Praised for Audio Quality
免責事項: Phemexニュースで提供されるコンテンツは、あくまで情報提供を目的としたものであり、第三者の記事から取得した情報の正確性・完全性・信頼性について保証するものではありません。本コンテンツは金融または投資の助言を目的としたものではなく、投資に関する最終判断はご自身での調査と、信頼できる専門家への相談を踏まえて行ってください。
