Original author(s) | Runway, CompVis, and Stability AI |
---|---|
Developer(s) | Stability AI |
Initial release | August 22, 2022 |
Stable release | SD 3.5 (model)[1]
/ October 22, 2024 |
Repository | |
Written in | Python[2] |
Type | Text-to-image model |
License | Stability AI Community License |
Website | stability |
Stable Diffusion is a deep learning, text-to-image model released in 2022 based on diffusion techniques. The generative artificial intelligence technology is the premier product of Stability AI and is considered to be a part of the ongoing artificial intelligence boom.
It is primarily used to generate detailed images conditioned on text descriptions, though it can also be applied to other tasks such as inpainting, outpainting, and generating image-to-image translations guided by a text prompt.[3] Its development involved researchers from the CompVis Group at Ludwig Maximilian University of Munich and Runway with a computational donation from Stability and training data from non-profit organizations.[4][5][6][7]
Stable Diffusion is a latent diffusion model, a kind of deep generative artificial neural network. Its code and model weights have been released publicly,[8] and it can run on most consumer hardware equipped with a modest GPU with at least 4 GB VRAM. This marked a departure from previous proprietary text-to-image models such as DALL-E and Midjourney which were accessible only via cloud services.[9][10]
stable-diffusion-launch
was invoked but never defined (see the help page).stable-diffusion-github
was invoked but never defined (see the help page).verge
was invoked but never defined (see the help page).