Yi-Large API & Playground

Yi-Large API Features

Fine-tuning Docs	Yi-Large can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments give you dedicated GPUs for Yi-Large using Fireworks' reliable, high-performance system with no rate limits.

Yi-Large FAQs

What is Yi-Large and who developed it?

Yi-Large is a 70B parameter dense language model developed by 01.AI. Yi-Large ranks among the top-performing open models on the LMSYS leaderboard, closely trailing GPT-4, Claude 3 Opus, and Gemini 1.5 Pro.

What applications and use cases does Yi-Large excel at?

Yi-Large is well-suited for:

Conversational AI
Code assistance
Agentic systems
Search
Enterprise RAG
Multilingual tasks (especially Spanish, Chinese, Japanese, German, and French)

What is the maximum context length for Yi-Large?

Yi-Large supports a context length of 32,800 tokens on Fireworks AI.

What is the usable context window for Yi-Large?

The maximum usable context window is 32.8K tokens, as defined by Fireworks AI's platform configuration.

How many parameters does Yi-Large have?

Yi-Large is a dense model with 70 billion parameters.

Is fine-tuning supported for Yi-Large?

Yes. Fireworks supports LoRA-based fine-tuning for this model.

What rate limits apply on the shared endpoint?

On Fireworks, on-demand deployments have no rate limits. Serverless access is not supported for this model.

Yi-Large

Yi-Large API Features

Fine-tuning

On-demand Deployment

Yi-Large FAQs

Metadata

Specification

Supported Functionality