GLM 5.2 is live! Opus-level intelligence at open-source rates. Pay per token on serverless. Try it today.

Model Library
/Deepseek/DeepSeek Coder 1.3B Base
Deepseek Logo Mark

DeepSeek Coder 1.3B Base

Ready
model path:accounts/fireworks/models/deepseek-coder-1b-base

DeepSeek Coder is composed of a series of code language models, each trained from scratch on 2T tokens, with a composition of 87% code and 13% natural language in both English and Chinese. deepseek-coder-1.3b-base is a 1.3B parameter model with Multi-Head Attention trained on 1 trillion tokens.

DeepSeek Coder 1.3B Base API Features

Fine-tuning

Docs

DeepSeek Coder 1.3B Base can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

On-demand Deployment

Docs

On-demand deployments allow you to use DeepSeek Coder 1.3B Base on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State
Ready
Created on
6/18/2024
Kind
Base model
Provider
Deepseek

Specification

Calibrated
No
Mixture-of-Experts
No
Parameters
1.34B

Supported Functionality

Fine-tuning
Supported
Serverless
Not supported
Context Length
16.3k tokens
Function Calling
Not supported
Embeddings
Not supported
Rerankers
Not supported
Support image input
Not supported