Stable Diffusion - Create Stunning Images on Your Windows GPU Server
Available in
AWS Marketplace
Available in
AWS Marketplace
Available in
AWS Marketplace
About
Stable Diffusion allows to render beautifully stunning images based on text or image input independently on your own AWS Windows cloud server with great performance.
Optimized to support large images with automatic upscaling. Uses GFPGAN to beautify faces.
Stable Diffusion creates images similar to Midjourney or OpenAI DALL-E.
Supports text2image as well as img2img to create impressive images based on other images with a guidance prompt controlling the influence on the generated image.
Leverages the Automatic Stable Diffusion bundle and GUI including built-in upscaling (ESRGAN, LDSR, ...), face restoration (GFPGAN, Codeformer, ...), inpainting, outpainting, textual inversion and many other powerful features as the most versatile Stable Diffusion integration.
Supported versions: Stable Diffusion 1.4 and 2.1.
Supports T4 GPUs with 16 GB of VRAM (g4dn family) and powerful A10 GPUs with 24 GB (g5 family).
Access the huge prompt and image library libraire.ai to get ideas for your own prompts and create new impressive images.
Uses DCV from AWS to offer high-end remote desktop. You can upload and download images created via the DCV interface.
This is a collaborative project of NI SP and AI SP.
More background on Stable Diffusion and license:
Stable Diffusion is a latent text-to-image diffusion model. Thanks to a generous compute donation from Stability AI and support from LAION, they were able to train a Latent Diffusion Model on 512x512 images from a subset of the LAION-5B database. Similar to Google's Imagen, this model uses a frozen CLIP ViT-L/14 text encoder to condition the model on text prompts. With its 860M UNet and 123M text encoder, the model is relatively lightweight and runs on a GPU with at least 10GB VRAM. See this section below and the model card. Stable Diffusion was trained on AWS GPU servers.
Stable Diffusion v1 refers to a specific configuration of the model architecture that uses a downsampling-factor 8 autoencoder with an 860M UNet and CLIP ViT-L/14 text encoder for the diffusion model. The model was pretrained on 256x256 images and then finetuned on 512x512 images.
Note: Stable Diffusion v1 is a general text-to-image diffusion model and therefore mirrors biases and (mis-)conceptions that are present in its training data. Details on the training procedure and data, as well as the intended use of the model can be found in the corresponding model card.
The weights are available via the CompVis organization at Hugging Face under a license which contains specific use-based restrictions to prevent misuse and harm as informed by the model card, but otherwise remains permissive. While commercial use is permitted under the terms of the license, we do not recommend using the provided weights for services or products without additional safety mechanisms and considerations, since there are known limitations and biases of the weights, and research on safe and ethical deployment of general text-to-image models is an ongoing effort. The weights are research artifacts and should be treated as such.
The CreativeML OpenRAIL M license is an Open RAIL M license, adapted from the work that BigScience and the RAIL Initiative are jointly carrying in the area of responsible AI licensing. See also the article about the BLOOM Open RAIL license on which our license is based.
Related Products
show moreBuyer guide
Read insights from real user interviews on why they chose this product.
How it works?
Search
Search 25000+ products and services vetted by AWS.
Request private offer
Our team will send you an offer link to view.
Purchase
Accept the offer in your AWS account, and start using the software.
Manage
All your transactions will be consolidated into one bill in AWS.
Create Your Marketplace with Webvar!
Launch your marketplace effortlessly with our solutions. Optimize sales processes and expand your reach with our platform.