FLUX.1 [schnell] is a 12 billion parameter rectified flow transformer capable of generating images from text descriptions. The FP8 version uses reduced precision numerics for 2x faster inference. FLUX.1 [schnell] FP8 is deployed as a Flumina app. See for more details: https://huggingface.co/fireworks-ai/FLUX.1-schnell-fp8-flumina
ServerlessDocs | Immediately run model on pre-configured GPUs and pay-per-token |
On-demand DeploymentDocs | On-demand deployments give you dedicated GPUs for FLUX.1 [schnell] FP8 using Fireworks' reliable, high-performance system with no rate limits. |
Run queries immediately, pay only for usage