Mistral AI /
Mistral 7B
accounts/fireworks/models/mistral-7b
LLM
The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters utilizing Grouped-query attention (GQA) for faster inference and Sliding window attention (SWA) to handle longer sequences at a lower cost.
On-demand deployments
On-demand deployments allow you to use Mistral 7B on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.
See the On-demand deployments guide for details.