LLM Treeof Life

Veo 2

Google DeepMind · December 2024

● activeCloseddiffusionmultimodal

Description

Google DeepMind's video generation model capable of producing high-quality 4K resolution videos. Features improved understanding of real-world physics (objects fall, liquids flow, light reflects realistically) and offers cinematic controls like camera angles, depth of field, and lens effects.

Key Innovations

Diffusion

DiffusionGenerates outputs by gradually denoising random noise into coherent images/audio. The backbone of Stable Diffusion and DALL·E.

Text-to-Video

Text-to-VideoGenerating video clips from text descriptions — one of the newest and most compute-intensive AI capabilities.

External Links

More from Google Gemini

Gemini 1.02023-12 · —

Gemini 1.5 Pro2024-02 · —

Gemini 2.02024-12 · —

Gemini 2.5 Pro2025-03 · —

Gemini 3.1 Pro2026-02 · ~1T (MoE)

Gemini 3.5 Flash2026-05 · —

Gemini 3.5 Pro2026-05 · —

Imagen 22023-12 · —

Imagen 32024-06 · —

Gemini 2.0 Flash2024-12 · —

PreviousGemini 2.0 Flash

NextGemini 2.5 Pro