Try the latest GLM-4.6 with extended context, superior coding, and refined intelligence. Now available on-demand

Quen Logo Mark

Qwen3 30B A3B Thinking 2507

Updated FP8 version of Qwen3-30B-A3B thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities

Fireworks Features

Fine-tuning

Qwen3 30B A3B Thinking 2507 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

Learn More

Serverless

Immediately run model on pre-configured GPUs and pay-per-token

Learn More

On-demand Deployment

On-demand deployments give you dedicated GPUs for Qwen3 30B A3B Thinking 2507 using Fireworks' reliable, high-performance system with no rate limits.

Learn More

Info & Pricing

Provider

Qwen

Model Type

LLM

Serverless

Available

Fine-Tuning

Available

Pricing Per 1M Tokens

$0.9