GLM 5.2 is live! Opus-level intelligence at open-source rates. Pay per token on serverless. Try it today.

Model Library
/Meta/Llama 2 70B
Meta Mark

Llama 2 70B

Ready
model path:accounts/fireworks/models/llama-v2-70b

Meta developed and publicly released the Llama 2 family of large language models (LLMs), a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama-2-Chat, are optimized for dialogue use cases. Llama-2-Chat models outperform open-source chat models on most benchmarks we tested, and in our human evaluations for helpfulness and safety, are on par with some popular closed-source models like ChatGPT and PaLM.

Llama 2 70B API Features

On-demand Deployment

Docs

On-demand deployments allow you to use Llama 2 70B on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State
Ready
Created on
1/3/2024
Kind
Base model
Provider
Meta

Specification

Calibrated
No
Mixture-of-Experts
No
Parameters
0

Supported Functionality

Fine-tuning
Not supported
Serverless
Not supported
Context Length
4.09k tokens
Function Calling
Not supported
Embeddings
Not supported
Rerankers
Not supported
Support image input
Not supported