The model is served on Fireworks AI in beta mode, constrained generation & tool calls are not yet fully supported.
Fine-tuningDocs | Qwen3.5 397B A17B can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model |
On-demand DeploymentDocs | On-demand deployments allow you to use Qwen3.5 397B A17B on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |