Qwen3.5 27B API & Playground

Qwen3.5-27B (Qwen Chat) is a post-trained, chat-optimized large language model released in Hugging Face Transformers format. It’s designed for strong general-purpose performance across reasoning, coding, and agentic tasks, and is compatible with popular inference stacks like Transformers, vLLM, and SGLang. Qwen3.5 emphasizes improved efficiency and scalability, with broader multilingual coverage and training advances aimed at high-utility real-world deployment.

Qwen3.5 27B API Features

Fine-tuning Docs	Qwen3.5 27B can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments allow you to use Qwen3.5 27B on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State

Ready

Created on

3/2/2026

Kind

Base model

Provider

Qwen

Hugging Face

Qwen/Qwen3.5-27B

Specification

Calibrated

Mixture-of-Experts

Parameters

27.3B

Supported Functionality

Fine-tuning

Supported

Serverless

Not supported

Context Length

262k tokens

Function Calling

Not supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported

Qwen3.5 27B

Qwen3.5 27B API Features

Fine-tuning

On-demand Deployment

Metadata

Specification

Supported Functionality