Qwen 3 4B Instruct 2507 API & Playground

Introducing Qwen3-4B-Instruct-2507, with improved instruction following, reasoning, coding, multilingual knowledge, user alignment, and 256K long-context understanding.

Qwen 3 4B Instruct 2507 API Features

Fine-tuning Docs	Qwen 3 4B Instruct 2507 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments allow you to use Qwen 3 4B Instruct 2507 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State

Ready

Created on

10/16/2025

Kind

Base model

Provider

Qwen

Hugging Face

Qwen/Qwen3-4B-Instruct-2507

Specification

Calibrated

Mixture-of-Experts

Parameters

4.41B

Supported Functionality

Fine-tuning

Supported

Serverless

Not supported

Context Length

262k tokens

Function Calling

Not supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported

Qwen 3 4B Instruct 2507

Qwen 3 4B Instruct 2507 API Features

Fine-tuning

On-demand Deployment

Metadata

Specification

Supported Functionality