Llama 3.1 8B Instruct API & Playground

The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models in 8B, 70B and 405B sizes. The Llama 3.1 instruction tuned text only models (8B, 70B, 405B) are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.

Llama 3.1 8B Instruct API Features

Fine-tuning Docs	Llama 3.1 8B Instruct can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand Deployment Docs	On-demand deployments allow you to use Llama 3.1 8B Instruct on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Llama 3.1 8B Instruct FAQs

Metadata

State

Ready

Created on

7/23/2024

Kind

Base model

Provider

Specification

Calibrated

Yes

Mixture-of-Experts

Parameters

8.83B

Supported Functionality

Fine-tuning

Supported

Serverless

Not supported

Context Length

131k tokens

Function Calling

Not supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported

Llama 3.1 8B Instruct

Llama 3.1 8B Instruct API Features

Fine-tuning

On-demand Deployment

Llama 3.1 8B Instruct FAQs

Metadata

Specification

Supported Functionality