Kimi K2 Instruct, a 1T parameter model with state of the art quality for coding, reasoning, and agentic tool use, is now available on Fireworks! Try now

Deepseek Logo Mark

DeepSeek R1 Distill Llama 8B

Llama 8B distilled with reasoning from Deepseek R1

Try Model

Fireworks Features

Fine-tuning

DeepSeek R1 Distill Llama 8B can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

Learn More

On-demand Deployment

On-demand deployments give you dedicated GPUs for DeepSeek R1 Distill Llama 8B using Fireworks' reliable, high-performance system with no rate limits.

Learn More

Info

Provider

Deepseek

Model Type

LLM

Context Length

131072

Fine-Tuning

Available

Pricing Per 1M Tokens

$0.2