Join the Fireworks Startups Program and unlock credits, expert support, and community to scale fast. Join here

Kimi K2 Instruct 0905

Kimi K2 0905 is an updated version of Kimi K2, a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Kimi K2 0905 has improved coding abilities, a longer context window, and agentic tool use, and a longer (262K) context window.

Fireworks Features

Fine-tuning

Kimi K2 Instruct 0905 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

Learn More

Serverless

Immediately run model on pre-configured GPUs and pay-per-token

Learn More

On-demand Deployment

On-demand deployments give you dedicated GPUs for Kimi K2 Instruct 0905 using Fireworks' reliable, high-performance system with no rate limits.

Learn More

Kimi K2 Instruct 0905 FAQs

Info & Pricing

Model Type

LLM

Context Length

262144

Serverless

Available

Fine-Tuning

Available

Pricing Per 1M Tokens Input/Output

$0.6 / $2.5