Qwen2.5 32B Instruct API & Playground

Qwen2.5 are a series of decoder-only language models developed by Qwen team, Alibaba Cloud, available in 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B sizes, and base and instruct variants.

Qwen2.5 32B Instruct API Features

Fine-tuning Docs	Qwen2.5 32B Instruct can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments allow you to use Qwen2.5 32B Instruct on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State

Ready

Created on

10/2/2024

Kind

Base model

Provider

Qwen

Hugging Face

Qwen/Qwen2.5-32B-Instruct

Specification

Calibrated

Mixture-of-Experts

Parameters

32.7B

Supported Functionality

Fine-tuning

Supported

Serverless

Not supported

Context Length

32.7k tokens

Function Calling

Not supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported

Qwen2.5 32B Instruct

Qwen2.5 32B Instruct API Features

Fine-tuning

On-demand Deployment

Metadata

Specification

Supported Functionality