Compact 4-billion parameter text-to-image model from Black Forest Labs. Step-distilled to 4 inference steps for sub-second generation. Unifies text-to-image and image editing in a single architecture with Apache 2.0 licensing.
Default price per megapixel

Example output from FLUX.2 [klein] 4B
FLUX.2 [klein] 4B is a compact, Apache 2.0-licensed image generation model from Black Forest Labs. At 4 billion parameters and just 4 inference steps, it delivers sub-second generation without sacrificing quality. It uses a Mistral 3 VLM text embedder and supports up to 4 megapixel output resolution.
Rectified flow transformer with FLUX.2 VAE for pixel-to-latent conversion. The distilled variant uses `guidance_scale=1.0` by default. Requires ~13 GB VRAM (fits on RTX 3090/4070+).
Ideal for high-volume production workloads, real-time applications, consumer-facing products, and edge deployment where speed and commercial licensing matter.
Get API access in minutes. No GPU setup, no infrastructure to manage.