Tired of video editing? Google’s Gemini Omni changes scenes when you ask

May 24, 2026:

Tired of video editing? Google’s Gemini Omni changes scenes when you ask

What you need to know

  • Google unveiled Gemini Omni, a new multimodal AI model built to generate and edit videos using text, images, audio, and video inputs.
  • The model is designed to be context-aware and physics-aware, helping generated videos look more realistic and coherent over longer creative sessions.
  • Gemini Omni remembers previous instructions during multi-step edits, which could make iterative video creation much smoother.

Gemini is going to be much more than a chatbot. During its I/O event today, the company announced a new multimodal AI model called Gemini Omni, which is designed to help you create and edit videos from just about any kind of input you give it.

According to the company, Gemini Omni can combine text, image, audio, and video references into fully generated clips that are designed to stay coherent across scenes and edits. This means the AI no longer relies on traditional prompts alone.

Source link