Fireworks AI raises $250M Series C to power the future of enterprise AI. Read more

Model Library
/Qwen/Qwen3 235B A22B Instruct 2507
Quen Logo Mark

Qwen3 235B A22B Instruct 2507

Ready
fireworks/qwen3-235b-a22b-instruct-2507

    Updated FP8 version of Qwen3-235B-A22B non-thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities

    Qwen3 235B A22B Instruct 2507 API Features

    Fine-tuning

    Docs

    Qwen3 235B A22B Instruct 2507 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

    Serverless

    Docs

    Immediately run model on pre-configured GPUs and pay-per-token

    On-demand Deployment

    Docs

    On-demand deployments give you dedicated GPUs for Qwen3 235B A22B Instruct 2507 using Fireworks' reliable, high-performance system with no rate limits.

    Available Serverless

    Run queries immediately, pay only for usage

    $0.22 / $0.88
    Per 1M Tokens (input/output)

    Qwen3 235B A22B Instruct 2507 FAQs

    Metadata

    State
    Ready
    Created on
    7/21/2025
    Kind
    Base model
    Provider
    Qwen
    Hugging Face
    Qwen3-235B-A22B-Instruct-2507-FP8

    Specification

    Calibrated
    Yes
    Mixture-of-Experts
    Yes
    Parameters
    235.1B

    Supported Functionality

    Fine-tuning
    Supported
    Serverless
    Supported
    Serverless LoRA
    Not supported
    Context Length
    262.1k tokens
    Function Calling
    Supported
    Embeddings
    Not supported
    Rerankers
    Not supported
    Support image input
    Not supported