New in Fireworks: Image-to-Image and ControlNet support for SSD-1B and SDXL!
By Fireworks.ai|11/2/2023
The Fireworks.ai blazing-fast inference platform enables developers to build with generative AI to accelerate product innovation.
We are thrilled to announce a set of comprehensive features available through our image generation APIs. These new features will empower generative AI developers to take their image-generation apps to the next level:
- Blazing-Fast Image Generation
- Image-to-Image Generation
- ControlNet Support
- Native Resolutions
- Safety Checker
- Cost-Effective Pricing
Try out the new features in the Fireworks console!
Try our API!
Blazing-Fast Image Generation
We are excited to introduce Segmind Stable Diffusion 1B (SSD-1B) to the Fireworks.ai inference platform. SSD-1B, released by Segmind, is one of the fastest diffusion-based text-to-image models available today. We are making it accessible to all our users via our fast inference platform with unprecedented image generation performance. It's now possible to generate 1024x1024 images in 30 steps in just 1 second!
Try out the SSD-1B model in the Fireworks console!
Image-to-Image
The Image-to-Image functionality can be used to transform photos with text prompts. To use it, provide an image description, a negative prompt (optional), and an initial image.
Here is a step-by-step example of how to use SDXL on the console:
Step 1: Use the following text prompt: “Fennec Fox Van Gogh, cartoon, purple, vibrant painting, fantasy concept”.
Step 2: Leave “Negative Prompt” empty for now and don't set any LoRA Adapter.
Step 3: Click on the image-to-image option under “Additional Conditioning” and upload the following image:
Step 4: Click on the “Generate Image” option. Here are a few generations obtained using the default parameters.
This is a lot of fun! Feel free to play with the console by creating an account, and learn more about image-to-image generation from our API reference.
ControlNet
The SSD-1B and SDXL models also now support using ControlNet to generate images using a provided image as guidance.
Let's use the same original image above to guide the generation of the SDXL example above. You can follow the same steps as before but now you set “Additional Conditioning” as “ControlNet” and upload the image. Here are a few of our own generations:
There is still more room for experimenting but you can see that the generated image is now much closer to the original image, all possible with the ControlNet support.
Learn more about ControlNet support for SDXL via our API reference.
Native Resolutions
The text-to-image feature now supports nine different resolutions of various aspect ratios: (1024, 1024), (1152, 896),
(896, 1152), (1216, 832), (832, 1216), (1344, 768), (768, 1344), (1536, 640), and (640, 1536). In addition, you can also
specify the return type to be either PNG
or JPEG
.
Safety Checker
It's fun to develop with these image generation models but it's important to also think about safety when developing with generative AI. To address this, we also now support enabling a safety check in the API for content filtering.
Enable safety_check: true in the API to run an unsafe content detection network on generated images. Unsafe images are blacked out and a CONTENT_FILTERED finish reason is returned.
To demonstrate how the safety check works, below is a sample code snippet that demonstrates how to use the
Fireworks.ai Python client to generate images with the
safety check enabled. Make sure to include your own FIREWORKS_API_KEY
and set safety_check=True
.
Try out the features in the Fireworks console!
Try our API!
Billing
We've also introduced our competitive cost-effective pricing for using the image-generation models. Check it out and hit us up with any questions.
Closing Words
We're amped to see how developers use these new features to accelerate product innovation and creative applications with the advanced image-generation capabilities of SSD-1B and SDXL on the Fireworks inference platform.
🗣 Join our community on Discord.
🎆 Learn more about the Fireworks Generative AI platform.