Kimi K2 Instruct API & Playground

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Trained with the Muon optimizer, Kimi K2 achieves exceptional performance across frontier knowledge, reasoning, and coding tasks while being meticulously optimized for agentic capabilities.

Kimi K2 Instruct API Features

Fine-tuning Docs	Kimi K2 Instruct can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments give you dedicated GPUs for Kimi K2 Instruct using Fireworks' reliable, high-performance system with no rate limits.

Metadata

State

Ready

Created on

7/11/2025

Kind

Base model

Provider

Moonshot AI

Hugging Face

Kimi-K2-Instruct

Specification

Calibrated

Mixture-of-Experts

Yes

Parameters

Supported Functionality

Fine-tuning

Supported

Serverless

Not supported

Serverless LoRA

Not supported

Context Length

131.1k tokens

Function Calling

Supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported

Kimi K2 Instruct

Kimi K2 Instruct API Features

Fine-tuning

On-demand Deployment

Metadata

Specification

Supported Functionality