Fireworks AI raises $250M Series C to power the future of enterprise AI. Read more

Model Library
/Qwen/Qwen2.5-VL 32B Instruct
Quen Logo Mark

Qwen2.5-VL 32B Instruct

Ready
fireworks/qwen2p5-vl-32b-instruct

    Qwen2.5-VL is a multimodal large language model series developed by Qwen team, Alibaba Cloud, available in 3B, 7B, 32B, and 72B sizes

    Fireworks Features

    Fine-tuning

    Docs

    Qwen2.5-VL 32B Instruct can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

    Serverless

    Docs

    Immediately run model on pre-configured GPUs and pay-per-token

    On-demand Deployment

    Docs

    On-demand deployments give you dedicated GPUs for Qwen2.5-VL 32B Instruct using Fireworks' reliable, high-performance system with no rate limits.

    Available Serverless

    Run queries immediately, pay only for usage

    $0.90 / $0.90
    Per 1M Tokens (input/output)

    Metadata

    State
    Ready
    Created on
    3/31/2025
    Kind
    Base model
    Provider
    Qwen
    Hugging Face
    Qwen2.5-VL-32B-Instruct

    Specification

    Calibrated
    Yes
    Mixture-of-Experts
    No
    Parameters
    33.5B

    Supported Functionality

    Fine-tuning
    Supported
    Serverless
    Supported
    Serverless LoRA
    Supported
    Context Length
    128k tokens
    Function Calling
    Not supported
    Embeddings
    Not supported
    Rerankers
    Not supported
    Support image input
    Supported