Qwen3 VL 30B A3B Thinking API & Playground

Qwen3-VL series delivers superior text understanding & generation, deeper visual perception & reasoning, extended context length, enhanced spatial and video dynamics comprehension, and stronger agent interaction capabilities. Available in Dense and MoE architectures that scale from edge to cloud, with Instruct and reasoning‑enhanced Thinking editions.

Qwen3 VL 30B A3B Thinking API Features

Fine-tuning Docs	Qwen3 VL 30B A3B Thinking can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments allow you to use Qwen3 VL 30B A3B Thinking on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State

Ready

Created on

10/8/2025

Kind

Base model

Provider

Qwen

Hugging Face

Qwen/Qwen3-VL-30B-A3B-Thinking

Specification

Calibrated

Mixture-of-Experts

Yes

Parameters

31B

Supported Functionality

Fine-tuning

Supported

Serverless

Not supported

Context Length

262k tokens

Function Calling

Supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Supported

Qwen3 VL 30B A3B Thinking

Qwen3 VL 30B A3B Thinking API Features

Fine-tuning

On-demand Deployment

Metadata

Specification

Supported Functionality