DeepSeek R1 (Fast) API & Playground

DeepSeek R1 (Fast) is the speed-optimized serverless deployment of DeepSeek-R1. Compared to the DeepSeek R1 (Basic) endpoint, R1 (Fast) provides faster speeds with higher per-token prices, see https://fireworks.ai/pricing for details. Identical models are served on the two endpoints, so there are no quality or quantization differences. DeepSeek-R1 is a state-of-the-art large language model optimized with reinforcement learning and cold-start data for exceptional reasoning, math, and code performance. The model is identical to the one uploaded by DeepSeek on HuggingFace. Note that fine-tuning for this model is only available through contacting fireworks at https://fireworks.ai/company/contact-us.

Fireworks Features

Fine-tuning Docs	DeepSeek R1 (Fast) can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments give you dedicated GPUs for DeepSeek R1 (Fast) using Fireworks' reliable, high-performance system with no rate limits.

DeepSeek R1 FAQs

Metadata

State

Ready

Created on

1/20/2025

Kind

Base model

Provider

Deepseek

Hugging Face

DeepSeek-R1

Specification

Calibrated

Yes

Mixture-of-Experts

Yes

Parameters

671B

Supported Functionality

Fine-tuning

Supported

Serverless

Not supported

Serverless LoRA

Not supported

Context Length

163.8k tokens

Function Calling

Not supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported

DeepSeek R1 (Fast)

Fireworks Features

Fine-tuning

On-demand Deployment

DeepSeek R1 FAQs

What is DeepSeek R1 (Fast) and who developed it?

What applications and use cases does DeepSeek R1 excel at?

What is the maximum context length for DeepSeek R1?

Does DeepSeek R1 support quantized formats (4-bit/8-bit)?

What is the default temperature of DeepSeek R1 on Fireworks AI?

What is the maximum output length for DeepSeek R1?

What are known failure modes of DeepSeek R1?

How many parameters does DeepSeek R1 have?

Is fine-tuning supported for DeepSeek R1?

What license governs commercial use of DeepSeek R1?

Metadata

Specification

Supported Functionality