Meta's Llama 2 model family is a collection of pretrained and fine-tuned generative text models trained on 40% more data than Llama 1 with double the context length. This is the 7B Base version trained on 2 trillion tokens with a context length of 4096.
On-demand deployments give you dedicated GPUs for Llama 2 7B using Fireworks' reliable, high-performance system with no rate limits.
Learn MoreMeta
LLM
4096
$0.2