Diffusion Models

Generative models that learn to denoise data through iterative refinement steps, widely used for image generation, video synthesis, and other creative AI applications.

Reading List

Products & Announcements

Krea 2: A Foundation Model for Creative Exploration and Control

Jun 24, 2026374

Krea 2 is an open-weights image foundation model series optimized for creative steerability and aesthetic diversity through a sophisticated multi-stage training and infrastructure stack.

AI Image Generation Foundation Models AI Infrastructure Diffusion Models AI Creativity

Under the Hood

I-DLM: Matching Autoregressive Quality with Parallel Diffusion Speed

Apr 14, 2026267

I-DLM achieves autoregressive-level quality and significantly higher throughput by incorporating a self-verification mechanism into parallel diffusion decoding.

Diffusion Models LLM Inference AI Architecture LLM Reasoning

Products & Announcements

Skyfall-GS: Real-Time City-Scale 3D from Satellite Images via Diffusion-Guided Refinement

Nov 3, 2025147

Skyfall-GS fuses satellite imagery with diffusion-driven iterative refinement to produce real-time, city-scale 3D scenes with superior geometry and textures—without 3D annotations.

Gaussian Splatting Computer Vision 3D Modeling Diffusion Models Satellite Imagery

Under the Hood

Single‑Pass Image Editing Showdown: Style Wins, Precision Still Hard

Oct 28, 2025342

Image editors are improving, but precise, localized, constraint-respecting edits remain the Achilles’ heel—even the best models stumble on spatial swaps and selective removals.

AI Benchmarks AI Image Generation AI Image Editing Diffusion Models

Products & Announcements

Ovi: Open-Source Text-to-Audio-Video Generation with Efficient Inference

Oct 22, 2025314

An open-source, configurable system for synchronized text-conditioned video and audio generation that runs on modest GPUs via quantization and parallelism.

AI Video Generation Multimodal AI Open Source Diffusion Models

Under the Hood

Turning BERT’s MLM Into a Text Diffusion Generator

Oct 20, 2025455

BERT-style MLM is a single-step text diffusion process, and extending it to multiple masking steps turns RoBERTa into a workable text generator.

Diffusion Models Natural Language Processing Text Generation Transformer Models

Under the Hood

HunyuanWorld-Voyager: World-Consistent RGB-D Video and 3D from a Single Image

Sep 3, 2025322

An open-source, world-consistent RGB-D video generator that turns a single image into controllable, long-range 3D scene explorations with state-of-the-art performance.

Diffusion Models Computer Vision 3D Modeling World Models AI Video Generation

Under the Hood

Sharing Early Diffusion Steps Across Similar Prompts for Efficient Text-to-Image Generation

Sep 2, 2025

Share early diffusion steps across similar prompts to generate image sets faster and better, without retraining.

Diffusion Models AI Image Generation Algorithms & Optimization