Mixtral MoE 8x7B Instruct (HF Version) is the original, FP16 version of Mixtral MoE 8x7B Instruct whose results should be consistent with the official Hugging Face implementation.
On-demand DeploymentDocs | On-demand deployments allow you to use Mixtral MoE 8x7B Instruct (HF version) on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |