In the last article, we introduced the techniques behind Stable Diffusion 3. This text-to-image model created by Stability AI is the strongest so far. No matter if you want to generate fantastical multi-subject scenes or high-precision landscape photographs, absolutely nothing is off-limits.
In the dead of night, Stability AI has released Stable Diffusion 3.0, employing the same DiT architecture as Sora, resulting in significantly improved visual quality, text rendering, and complex object comprehension.
Next comes a guide on how to use Stable Cascade’s training code and download its required model guide. Especially, it includes training scripts for various use cases, such as Image-to-text, ControlNet, LoRA, and image reconstruction.
In the last article, we briefly went through the basics of Stable Cascade. In part two we are going to provide you with a detailed guide to using Stable Cascade, including inference on the model and instructions for using the extended functionality.