The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters utilizing Grouped-query attention (GQA) for faster inference and Sliding window attention (SWA) to handle longer sequences at a lower cost.
Mistral 7B can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
On-demand deployments give you dedicated GPUs for Mistral 7B using Fireworks' reliable, high-performance system with no rate limits.
Mistral
32768
Available
$0.2