DeepSeek R1 0528, an updated version of the state-of-the-art DeepSeek R1 model, is now available. Try it now!

Quen Logo Mark

Qwen3 235B-A22B

Qwen3 is the latest evolution in the Qwen LLM series, featuring both dense and MoE models with major advancements in reasoning, agent capabilities, multilingual support, and instruction following. It uniquely allows seamless switching between “thinking” (for complex logic, math, coding) and “non-thinking” modes (for fast, general dialogue), delivering strong performance across tasks. Qwen3 outperforms previous Qwen models in math, code, and logical reasoning, while also offering superior human alignment for creative writing, roleplay, and multi-turn conversations. It supports over 100 languages and excels at tool integration for agent-based tasks. The flagship model, Qwen3-235B-A22B, has 235B parameters (22B active), 94 layers, and a native context length of 32K, extendable to 131K with YaRN.

Try Model

Fireworks Features

Serverless

Immediately run model on pre-configured GPUs and pay-per-token

Learn More

On-demand Deployment

On-demand deployments give you dedicated GPUs for Qwen3 235B-A22B using Fireworks' reliable, high-performance system with no rate limits.

Learn More

Info

Provider

Qwen

Model Type

LLM

Context Length

128K

Serverless

Available

Pricing Per 1M Tokens Input/Output

$0.22 / $0.88