Try the latest GLM-4.6 with extended context, superior coding, and refined intelligence. Now available on-demand

Model Library
/Qwen/Qwen3 Embedding 8B
Quen Logo Mark

Qwen3 Embedding 8B

Ready
fireworks/qwen3-embedding-8b

    The Qwen3 Embedding 8B model is the latest proprietary model of the Qwen family, specifically designed for text embedding tasks. This model inherits the exceptional multilingual capabilities, long-text understanding, and reasoning skills building upon the dense foundational models of the Qwen3 series. The model represents significant advancements in multiple text embedding tasks including text retrieval, code retrieval, text classification, text clustering.

    Fireworks Features

    Serverless

    Docs

    Immediately run model on pre-configured GPUs and pay-per-token

    On-demand Deployment

    Docs

    On-demand deployments give you dedicated GPUs for Qwen3 Embedding 8B using Fireworks' reliable, high-performance system with no rate limits.

    0

    Metadata

    State
    Ready
    Created on
    8/20/2025
    Kind
    Kind 10
    Provider
    Qwen
    Hugging Face
    Qwen3-Embedding-8B

    Specification

    Calibrated
    No
    Mixture-of-Experts
    No
    Parameters
    8.2B

    Supported Functionality

    Fine-tuning
    Not supported
    Serverless
    Supported
    Serverless LoRA
    Not supported
    Context Length
    41k tokens
    Function Calling
    Not supported
    Embeddings
    Not supported
    Rerankers
    Not supported
    Support image input
    Not supported