Model Library
Llama 4
Llama 4 Maverick provides SOTA intelligence and blazing fast speeds across multiple languages. Llama 4 Scout is a general purpose LLM equipped with multi-modality and 10M token context window to excel at use cases like multi-document summarization.
Try It Learn more
Llama 4 Maverick Instruct (Basic)
$0.22/M Input • $0.88/M Output • 1M Context
Serverless

Llama 4 Scout Instruct (Basic)
$0.15/M Input • $0.60/M Output • 128k Context
Serverless

Llama 3.1 405B Instruct
$3.00/M Tokens • 128k Context
ServerlessTunable

DeepSeek R1 (Fast)
$3.00/M Input • $8.00/M Output • 160k Context
ServerlessTunable

DeepSeek V3
$0.90/M Tokens • 128k Context
ServerlessTunable