GLM-4.7 is a next-generation general-purpose model optimized for coding, reasoning, and agentic workflows, delivering strong gains in multilingual software engineering, tool use, and complex problem solving. It introduces advanced thinking controls: interleaved, preserved, and turn-level thinking; to improve stability on long-horizon, multi-turn tasks. You can explore these thinking modes on our API using the `reasoning_history` field. Learn more here - https://docs.fireworks.ai/guides/reasoning
Fine-tuningDocs | GLM-4.7 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model |
On-demand DeploymentDocs | On-demand deployments allow you to use GLM-4.7 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |