Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Trained with the Muon optimizer, Kimi K2 achieves exceptional performance across frontier knowledge, reasoning, and coding tasks while being meticulously optimized for agentic capabilities.
Kimi K2 Instruct is available via Fireworks' serverless API, where you pay per token. There are several ways to call the Fireworks API, including Fireworks' Python client, the REST API, or OpenAI's Python client.
Learn MoreOn-demand deployments allow you to use Kimi K2 Instruct on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.
Learn MoreKimi K2 excels at tool calling and agentic use cases. Function calling for this model is available to try on Fireworks today.
Learn MoreMoonshot AI
Available
$1 / $3