Kimi K2 Instruct, a 1T parameter model with state of the art quality for coding, reasoning, and agentic tool use, is now available on Fireworks! Try now

mooshot

Kimi K2 Instruct

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Trained with the Muon optimizer, Kimi K2 achieves exceptional performance across frontier knowledge, reasoning, and coding tasks while being meticulously optimized for agentic capabilities.

Try Model

Fireworks Features

Serverless

Kimi K2 Instruct is available via Fireworks' serverless API, where you pay per token. There are several ways to call the Fireworks API, including Fireworks' Python client, the REST API, or OpenAI's Python client.

Learn More

On-demand deployment

On-demand deployments allow you to use Kimi K2 Instruct on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Learn More

Function Calling

Kimi K2 excels at tool calling and agentic use cases. Function calling for this model is available to try on Fireworks today.

Learn More

Info

Provider

Moonshot AI

Model Type

LLM

Serverless

Available

Pricing Per 1M Tokens Input/Output

$1 / $3