GLM-4.7 is a next-generation general-purpose model optimized for coding, reasoning, and agentic workflows, delivering strong gains in multilingual software engineering, tool use, and complex problem solving. It introduces advanced thinking controls—interleaved, preserved, and turn-level thinking—to improve stability on long-horizon, multi-turn tasks.
Fine-tuningDocs | GLM-4.7 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model |
ServerlessDocs | Immediately run model on pre-configured GPUs and pay-per-token |
On-demand DeploymentDocs | On-demand deployments give you dedicated GPUs for GLM-4.7 using Fireworks' reliable, high-performance system with no rate limits. |
Run queries immediately, pay only for usage