Latest Qwen3 thinking model, competitive against the best close source models in Jul 2025.
Fine-tuningDocs | Qwen3 235B A22B Thinking 2507 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model |
ServerlessDocs | Immediately run model on pre-configured GPUs and pay-per-token |
On-demand DeploymentDocs | On-demand deployments give you dedicated GPUs for Qwen3 235B A22B Thinking 2507 using Fireworks' reliable, high-performance system with no rate limits. |
Run queries immediately, pay only for usage