The Mistral-Nemo-Instruct-2407 Large Language Model (LLM) is the instruction-tuned version of Mistral-Nemo-Base-2407 and has the chat completions API enabled. Trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size.
On-demand DeploymentDocs | On-demand deployments allow you to use Mistral Nemo Instruct 2407 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |