Qwen3 30B A3B Thinking 2507 API & Playground

Updated FP8 version of Qwen3-30B-A3B thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities

Qwen3 30B A3B Thinking 2507 API Features

Fine-tuning Docs	Qwen3 30B A3B Thinking 2507 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments allow you to use Qwen3 30B A3B Thinking 2507 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State

Ready

Created on

7/30/2025

Kind

Base model

Provider

Qwen

Hugging Face

Qwen/Qwen3-30B-A3B-Thinking-2507-FP8

Specification

Calibrated

Yes

Mixture-of-Experts

Parameters

30.5B

Supported Functionality

Fine-tuning

Supported

Serverless

Not supported

Context Length

262k tokens

Function Calling

Supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported

Qwen3 30B A3B Thinking 2507

Qwen3 30B A3B Thinking 2507 API Features

Fine-tuning

On-demand Deployment

Metadata

Specification

Supported Functionality