In practice
It powers Stable Diffusion, Midjourney, Sora. When you integrate image generation what matters is the trade-off between quality, speed (number of steps), and control. Costs are in GPU-seconds rather than tokens.
Related terms
Seen in the wild
5 entries mentioning it- HighFlux 1.0 (Black Forest Labs): 12B parameters, flow matching, the new open source SOTA
- MediumIdeogram 1.0: readable text in generated images, the historic gap closes
- HighStable Diffusion XL 0.9: dual-encoder and 1024x1024 resolution
- HighControlNet: structural control for Stable Diffusion without retraining
- LandmarkStable Diffusion: image generation goes open