Meta's Llama 2 model family is a collection of pretrained and fine-tuned generative text models trained on 40% more data than Llama 1 with double the context length. This is the 7B Base version trained on 2 trillion tokens with a context length of 4096.
Llama 2 7B can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
Learn MoreOn-demand deployments give you dedicated GPUs for Llama 2 7B using Fireworks' reliable, high-performance system with no rate limits.
Learn MoreMeta
4096
Available
$0.2