Skip to main content
Runflow

SDXL-Lightning

ByteDance lightning-fast SDXL distillation. 4-step text-to-image at near-real-time latency.

By ByteDance

Pricing: $0.003 per request

Overview

SDXL-Lightning by ByteDance is a lightning-fast SDXL distillation. 4-step text-to-image at near-real-time latency, suited to interactive UIs and rapid creative iteration.

Key capabilities

  • **$0.003 per image at 1024x1024**, fixed per-call pricing. About 333 runs per $1.
  • **Multi-region failover**, Runflow routes SDXL-Lightning across multiple regions. If one region has an outage, traffic moves to another automatically.
  • **4-step inference**, distilled for sub-second to low-second response on a single GPU.
  • **SDXL base**, full Stable Diffusion XL 1.0 compatibility.

SDXL family

| Model | Details |

|---|---|

| **SDXL-Lightning** (this model) | 4-step lightning-fast SDXL distillation |

| **SDXL base** | Stable Diffusion XL 1.0 reference model |

Technical specifications

| Spec | Details |

|---|---|

| Architecture | SDXL distilled with adversarial diffusion |

| Inference Steps | 4 (default), up to 12 |

| Inputs | prompt, negative_prompt, seed, width, height, guidance, steps |

| Output Format | JPEG |

Examples

  • a cyberpunk city street at night, neon signs glowing

Frequently asked questions

What is SDXL-Lightning?
SDXL-Lightning by ByteDance is a 4-step distillation of Stable Diffusion XL. Near-real-time text-to-image, suited to interactive UIs and rapid creative iteration.
How much does SDXL-Lightning cost on Runflow?
SDXL-Lightning is $0.003 per image at 1024x1024, fixed per-call pricing. About 333 runs per $1.
What's the latency of SDXL-Lightning?
The 4-step distillation is built for sub-second to low-second response on a single GPU. Typical runs return in around 1 to 2 seconds on Runflow's API.
Can I use SDXL-Lightning commercially?
Yes. Runs made through Runflow's API are licensed for commercial use, including embedding outputs in customer-facing products.
Do I need to manage GPUs to run SDXL-Lightning?
No GPU management is required. Runflow handles inference infrastructure across multiple regions and routes requests automatically. You hit a single REST endpoint, you get a result.

Related models

  • FLUX 1.1 [pro] Ultra, Ultra-resolution variant of FLUX 1.1 [pro] with 4MP+ outputs and richer detail. Drop-in upgrade when the standard FLUX 1.1 [pro] resolution is the bottleneck.
  • FLUX 1.1 [pro], Black Forest Labs' production-grade FLUX 1.1. State-of-the-art prompt adherence and photoreal quality at competitive per-image pricing.
  • GPT Image 2, GPT Image 2.0, OpenAI's latest image model, is capable of creating extremely detailed images with fine typography.
  • FLUX.2 [klein] 9B, Text-to-image generation with FLUX.2 [klein] 9B from Black Forest Labs. Enhanced realism, crisper text generation, and native editing capabilities.

Discoverable surfaces

Production-ready solutions

View all