Fireworks RFT now available! Fine-tune open models that outperform frontier models. Try today

Model Library
/Z.ai/GLM-4.5V
z.ai

GLM-4.5V

Ready
fireworks/glm-4p5v

    GLM-4.5V is based on ZhipuAI’s next-generation flagship text foundation model GLM-4.5-Air (106B parameters, 12B active). It continues the technical approach of GLM-4.1V-Thinking, achieving SOTA performance among models of the same scale on 42 public vision-language benchmarks. It covers common tasks such as image, video, and document understanding, as well as GUI agent operations.

    GLM-4.5V API Features

    On-demand Deployment

    Docs

    On-demand deployments give you dedicated GPUs for GLM-4.5V using Fireworks' reliable, high-performance system with no rate limits.

    GLM-4.5V FAQs

    What is GLM-4.5V and who developed it?

    GLM-4.5V is a vision-language model developed by ZhipuAI (Z.ai). It is based on the GLM-4.5-Air architecture with 106B total parameters (12B active) and continues the GLM-4.1V technical lineage. It achieves state-of-the-art (SOTA) results across 42 public V+L benchmarks and supports image, video, document understanding, and GUI agent operations.

    What applications and use cases does GLM-4.5V excel at?

    GLM-4.5V is designed for real-world multimodal reasoning and excels at:

    • Scene and image understanding
    • Video segmentation and event recognition
    • GUI automation (e.g. screen reading, icon recognition)
    • Long document parsing and chart interpretation
    • Visual grounding (bounding boxes)
    • Bilingual multimodal tasks (Chinese/English)
    What is the maximum context length for GLM-4.5V?

    The model supports a maximum context length of 131.1k tokens.

    What are known failure modes of GLM-4.5V?

    Documented limitations include:

    • Occasional overthinking or repetition
    • Raw HTML output in frontend code without proper formatting
    • Minor perception issues (e.g. object counting, character identification)
    • Slightly weaker performance on pure text-only Q&A
    How many parameters does GLM-4.5V have?
    • Total Parameters: 108B
    • Active Parameters: 12B (Mixture-of-Experts model)
    Is fine-tuning supported for GLM-4.5V?

    No, fine-tuning is not supported for GLM-4.5V on Fireworks AI.

    What rate limits apply on the shared endpoint?

    GLM-4.5V is only available via on-demand deployment, which comes with no rate limits.

    What license governs commercial use of GLM-4.5V?

    GLM-4.5V is released under the MIT License, and commercial use is permitted.

    Metadata

    State
    Ready
    Created on
    8/11/2025
    Kind
    Base model
    Provider
    Z.ai
    Hugging Face
    GLM-4.5V

    Specification

    Calibrated
    No
    Mixture-of-Experts
    Yes
    Parameters
    108B

    Supported Functionality

    Fine-tuning
    Not supported
    Serverless
    Not supported
    Serverless LoRA
    Not supported
    Context Length
    131.1k tokens
    Function Calling
    Supported
    Embeddings
    Not supported
    Rerankers
    Not supported
    Support image input
    Supported