The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters utilizing Grouped-query attention (GQA) for faster inference and Sliding window attention (SWA) to handle longer sequences at a lower cost.
On-demand deployments give you dedicated GPUs for Mistral 7B using Fireworks' reliable, high-performance system with no rate limits.
Learn MoreMistral
LLM
32768
$0.2