GLM 5.2 is live! Opus-level intelligence at open-source rates. Pay per token on serverless. Try it today.

Model Library
/Z.ai/GLM-4.7
model path:accounts/fireworks/models/glm-4p7

GLM-4.7 is a next-generation general-purpose model optimized for coding, reasoning, and agentic workflows, delivering strong gains in multilingual software engineering, tool use, and complex problem solving. It introduces advanced thinking controls: interleaved, preserved, and turn-level thinking; to improve stability on long-horizon, multi-turn tasks. You can explore these thinking modes on our API using the `reasoning_history` field. Learn more here - https://docs.fireworks.ai/guides/reasoning

GLM-4.7 API Features

Fine-tuning

Docs

GLM-4.7 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

On-demand Deployment

Docs

On-demand deployments allow you to use GLM-4.7 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State
Ready
Created on
12/22/2025
Kind
Base model
Provider
Z.ai
Hugging Face
zai-org/GLM-4.7

Specification

Calibrated
No
Mixture-of-Experts
Yes
Parameters
352B

Supported Functionality

Fine-tuning
Supported
Serverless
Not supported
Context Length
202k tokens
Function Calling
Supported
Embeddings
Not supported
Rerankers
Not supported
Support image input
Not supported