GLM 5.2 is live! Opus-level intelligence at open-source rates. Pay per token on serverless. Try it today.

Model Library
/Qwen/Qwen3 Next 80B A3B Instruct
Quen Logo Mark

Qwen3 Next 80B A3B Instruct

Ready
model path:accounts/fireworks/models/qwen3-next-80b-a3b-instruct

Qwen3 Next 80B A3B Instruct is a state-of-the-art mixture-of-experts (MoE) language model with 3 billion activated parameters and 80 billion total parameters. It features a hybrid attention architecture for efficient processing and supports contexts up to 262K tokens. To ensure sufficient GPU memory capacity, we recommend deploying this model on 2 NVIDIA H200 or 4 NVIDIA H100 GPUs.

Qwen3 Next 80B A3B Instruct API Features

On-demand Deployment

Docs

On-demand deployments allow you to use Qwen3 Next 80B A3B Instruct on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State
Ready
Created on
9/16/2025
Kind
Base model
Provider
Qwen

Specification

Calibrated
No
Mixture-of-Experts
No
Parameters
80B

Supported Functionality

Fine-tuning
Not supported
Serverless
Not supported
Context Length
N/A
Function Calling
Not supported
Embeddings
Not supported
Rerankers
Not supported
Support image input
Not supported