
Gemini Omni: Conversational Video Creation and Multimodal Editing
Gemini Omni is a conversational AI model that enables sophisticated video creation and editing by combining multimodal inputs with real-world reasoning.
AI systems that learn to simulate and predict the dynamics of environments, including physics, interactions, and spatial consistency, enabling real-time generation of interactive worlds and applications in robotics, gaming, and AGI research.

Gemini Omni is a conversational AI model that enables sophisticated video creation and editing by combining multimodal inputs with real-world reasoning.

ARC-AGI-3 is an interactive benchmark designed to measure AGI by testing an agent's ability to learn and adapt as efficiently as a human.

Google’s Project Genie lets AI Ultra users create and explore real-time, interactive worlds powered by the Genie 3 world model.
World models now mean assets, simulators, or brains—three different layers of the same aim to give machines structured understanding beyond next-token prediction.

An open-source, world-consistent RGB-D video generator that turns a single image into controllable, long-range 3D scene explorations with state-of-the-art performance.

AI is chasing coherent internal world models to move beyond brittle heuristics and achieve robust, reliable reasoning.