Try the latest GLM-4.6 with extended context, superior coding, and refined intelligence. Now available on-demand

Quen Logo Mark

Qwen3 235B A22B Instruct 2507

Updated FP8 version of Qwen3-235B-A22B non-thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities

Fireworks Features

Fine-tuning

Qwen3 235B A22B Instruct 2507 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

Learn More

Serverless

Immediately run model on pre-configured GPUs and pay-per-token

Learn More

On-demand Deployment

On-demand deployments give you dedicated GPUs for Qwen3 235B A22B Instruct 2507 using Fireworks' reliable, high-performance system with no rate limits.

Learn More

Qwen3 235B A22B Instruct 2507 FAQs

Info & Pricing

Provider

Qwen

Model Type

LLM

Context Length

262144

Serverless

Available

Fine-Tuning

Available

Pricing Per 1M Tokens Input/Output

$0.22 / $0.88