Model Library
Llama 3.3 70B Instruct
This updated version of Llama 70B achieves state-of-the-art results in reasoning, math, and instruction following. It offers similar performance to Llama 3.1 405B but with significantly better speed and lower cost.
Try It Learn more
Llama 3.1 405B Instruct
$3.00/M Tokens • 128k Context
Serverless

DeepSeek R1
$3.00/M Input • $8.00/M Output • 160k Context
Serverless

DeepSeek V3
$0.90/M Tokens • 128k Context
Serverless

Llama 3.1 8B Instruct
$0.20/M Tokens • 128k Context
ServerlessTunable

Deepseek R1 Distill Qwen 7B
Tunable