Qwen QWQ 32B Preview API

Qwen QwQ model focuses on advancing AI reasoning, and showcases the power of open models to match closed frontier model performance.QwQ-32B-Preview is an experimental release, comparable to o1 and surpassing GPT-4o and Claude 3.5 Sonnet on analytical and reasoning abilities across GPQA, AIME, MATH-500 and LiveCodeBench benchmarks. Note: This model is served experimentally as a serverless model. If you're deploying in production, be aware that Fireworks may undeploy the model with short notice.

Qwen QWQ 32B Preview API Features

Fine-tuning Docs	Qwen QWQ 32B Preview can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments give you dedicated GPUs for Qwen QWQ 32B Preview using Fireworks' reliable, high-performance system with no rate limits.

Metadata

State

Ready

Created on

11/27/2024

Kind

Base model

Provider

Qwen

Hugging Face

QWQ-32B-Preview

Specification

Calibrated

Mixture-of-Experts

Parameters

32.8B

Supported Functionality

Fine-tuning

Supported

Serverless

Not supported

Serverless LoRA

Supported

Context Length

32.8k tokens

Function Calling

Not supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported

Qwen QWQ 32B Preview