GLM 5.2 is live! Opus-level intelligence at open-source rates. Pay per token on serverless. Try it today.

Model Library
/Qwen/Qwen2.5 32B Instruct
Quen Logo Mark

Qwen2.5 32B Instruct

Ready
model path:accounts/fireworks/models/qwen2p5-32b-instruct

Qwen2.5 are a series of decoder-only language models developed by Qwen team, Alibaba Cloud, available in 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B sizes, and base and instruct variants.

Qwen2.5 32B Instruct API Features

Fine-tuning

Docs

Qwen2.5 32B Instruct can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

On-demand Deployment

Docs

On-demand deployments allow you to use Qwen2.5 32B Instruct on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State
Ready
Created on
10/2/2024
Kind
Base model
Provider
Qwen

Specification

Calibrated
No
Mixture-of-Experts
No
Parameters
32.7B

Supported Functionality

Fine-tuning
Supported
Serverless
Not supported
Context Length
32.7k tokens
Function Calling
Not supported
Embeddings
Not supported
Rerankers
Not supported
Support image input
Not supported