Stable Diffusion 1.5

Stability AI · August 2022

● activeOpen Weightdiffusionimage

Parameters~860M

Why It Matters

Broke open the AI image generation field by being the first high-quality model anyone could download and run for free, creating an entire ecosystem of tools, extensions, and custom models.

Description

The open-source model that democratized AI image generation. Uses a technique called latent diffusion — generating images by working in a compressed mathematical space rather than pixel-by-pixel, which made it fast enough to run on consumer GPUs. Trained on billions of image-text pairs from the LAION-5B dataset, it spawned a massive community of artists, developers, and hobbyists.

Notable Milestones

▸Enabled local AI image generation on consumer hardware for the first time
▸Spawned thousands of community-created custom models and extensions
▸Foundation for ControlNet, LoRA fine-tuning, and other community innovations

Key Innovations

Diffusion

DiffusionGenerates outputs by gradually denoising random noise into coherent images/audio. The backbone of Stable Diffusion and DALL·E.

Text-to-Image

Text-to-ImageGenerating images from text descriptions — the technology behind DALL·E, Midjourney, and Stable Diffusion.

Open Weight

Open WeightModel weights are publicly released but training data/code may not be. Enables fine-tuning but not full reproduction.

Family Tree

Successors (1)

Stable Diffusion XL

Related Research (2)

DDPM / DiffusionDiffusion

2020 · UC Berkeley

Showed that gradually adding noise to data and then learning to reverse the process could generate images rivaling GANs, with more stable training and…

CLIPTransformer

2021 · OpenAI

Trained a model to understand both images and text by learning which image-text pairs go together from 400 million internet examples. This created a s…

External Links

Research Paper Announcement

More from Stability AI

Stable Diffusion XL2023-07 · 6.6B

Stable Diffusion 32024-02 · —

NextStable Diffusion XL