Llama 3.1 405B Instruct Long API & Playground

The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models in 8B, 70B and 405B sizes. The Llama 3.1 instruction tuned text only models (8B, 70B, 405B) are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks. 405B model is the most capable from the Llama 3.1 family.

Llama 3.1 405B Instruct Long API Features

On-demand Deployment

Docs

On-demand deployments allow you to use Llama 3.1 405B Instruct Long on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State

Ready

Created on

8/14/2024

Kind

Base model

Provider

Specification

Calibrated

Mixture-of-Experts

Parameters

Supported Functionality

Fine-tuning

Not supported

Serverless

Not supported

Context Length

N/A

Function Calling

Not supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported