Kimi 2.7 Code API & Playground

Kimi K2.7 Code is a coding-focused agentic model built upon Kimi K2.6. With substantial improvements on real-world long-horizon coding tasks, it strengthens end-to-end task completion across complex software engineering workflows while improving token efficiency, reducing thinking-token usage by approximately 30% compared with Kimi K2.6.

Kimi K2.7 Code API Features

Fine-tuning Docs	Kimi K2.7 Code can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
Serverless Docs	Kimi K2.7 Code is available via Fireworks' serverless API, where you pay per token. There are several ways to call the Fireworks API, including Fireworks' Python client, the REST API, or OpenAI's Python client.
On-demand Deployment Docs	On-demand deployments allow you to use Kimi K2.7 Code on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Available Serverless

Run queries immediately, pay only for usage

$0.95 / $0.19 / $4.00

Per 1M Tokens (input/cached input/output)

Metadata

State

Ready

Created on

6/12/2026

Kind

Base model

Provider

Moonshot AI

Hugging Face

moonshotai/Kimi-K2.7-Code

Specification

Calibrated

Mixture-of-Experts

Yes

Parameters

1.02T

Supported Functionality

Fine-tuning

Supported

Serverless

Supported

Context Length

262k tokens

Function Calling

Supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Supported

Kimi K2.7 Code

Kimi K2.7 Code API Features

Fine-tuning

Serverless

On-demand Deployment

Available Serverless

Metadata

Specification

Supported Functionality