Navigate:

All ReposStable Diffusion

~$STAB↑0.3%

Stable Diffusion: Latent text-to-image diffusion model

CLIP-conditioned latent diffusion model for text-to-image synthesis.

LIVE RANKINGS • 10:20 AM • STEADY

OVERALL

#143

AI & ML

#56

30 DAY RANKING TREND

ovr#143

·AI#56

STARS

72.6K

FORKS

10.6K

7D STARS

+214

7D FORKS

+28

Tags:

AI & ML

See Repo:

Learn more about Stable Diffusion

Stable Diffusion is a latent diffusion model for text-to-image synthesis. It operates by encoding images into a latent space using a downsampling-factor 8 autoencoder, then applying diffusion processes conditioned on text embeddings from a CLIP encoder. The architecture consists of an 860M parameter UNet and a 123M parameter text encoder, designed to run on GPUs with at least 10GB VRAM. The model was pretrained on 256x256 images and subsequently fine-tuned on 512x512 images, making it suitable for generating images from natural language descriptions.

Latent Space Diffusion

Operates in compressed latent space with 8x downsampling rather than pixel space, reducing computational cost by 64x per dimension. Maintains high image quality while enabling faster generation on consumer hardware compared to pixel-based diffusion models.

Consumer GPU Compatible

860M UNet and 123M text encoder architecture runs on GPUs with 10GB VRAM. Enables local deployment without cloud infrastructure or high-end datacenter hardware.

CLIP Text Conditioning

Uses frozen CLIP ViT-L/14 encoder for text-to-image conditioning with 123M parameters. Leverages pretrained vision-language representations for flexible natural language control without custom text encoder training.

from diffusers import StableDiffusionPipeline
import torch

pipe = StableDiffusionPipeline.from_pretrained(
    "CompVis/stable-diffusion-v1-4",
    torch_dtype=torch.float16
).to("cuda")

image = pipe("a photograph of an astronaut riding a horse").images[0]
image.save("astronaut.png")

See how people are using Stable Diffusion

Loading tweets...

Top in AI & ML

Trending Repos

Pi Mono

17,222#1

OpenClaw

233,443#2

Zvec

8,089#3

Claude Code

70,649#4

Heretic

9,761#5

See all →

LIVE RANKINGS • 10:20 AM • STEADY

OVERALL

#143

AI & ML

#56

30 DAY RANKING TREND

ovr#143

·AI#56

STARS

72.6K

FORKS

10.6K

7D STARS

+214

7D FORKS

+28

[ EXPLORE MORE ]

Related Repositories

Discover similar tools and frameworks used by developers

Stable Diffusion: Latent text-to-image diffusion model

Learn more about Stable Diffusion

What is Stable Diffusion for?

What makes Stable Diffusion different?

Latent Space Diffusion

Consumer GPU Compatible

CLIP Text Conditioning

Example code snippets

See how people are using Stable Diffusion

Top in AI & ML

Pi Mono

OpenClaw

Claude Code

Heretic

Rowboat

Trending Repos

Pi Mono

OpenClaw

Zvec

Claude Code

Heretic

Related Repositories

Summarize

Higgsfield

PyTorch

Wan2.2

Llama

Product

Company

Helpful Links