DeepSeek R1 0528, an updated version of the state-of-the-art DeepSeek R1 model, is now available. Try it now!

Llama 3.3 70B Instruct

Llama 3.3 70B Instruct is the December update of Llama 3.1 70B. The model improves upon Llama 3.1 70B (released July 2024) with advances in tool calling, multilingual text support, math and coding. The model achieves industry leading results in reasoning, math and instruction following and provides similar performance as 3.1 405B but with significant speed and cost improvements.

Try Model

Fireworks Features

Fine-tuning

Llama 3.3 70B Instruct can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

Learn More

Serverless

Immediately run model on pre-configured GPUs and pay-per-token

Learn More

On-demand Deployment

On-demand deployments give you dedicated GPUs for Llama 3.3 70B Instruct using Fireworks' reliable, high-performance system with no rate limits.

Learn More

Info

Provider

Model Type

LLM

Context Length

131072

Serverless

Available

Fine-Tuning

Available

Pricing Per 1M Tokens

$0.9

Llama 3.3 70B Instruct

Fireworks Features

Fine-tuning

Serverless

On-demand Deployment

Info

Provider

Model Type

Context Length

Serverless

Fine-Tuning

Pricing Per 1M Tokens

Pages

Company

Legal

Connect

Platform