The production
AI platform built for developers
Fireworks partners with the world's leading generative AI researchers to serve the best models, at the fastest speeds.
Companies of all sizes trust Fireworks to power their production AI use-cases
Models curated and optimized by Fireworks
Mixtral MoE 8x7B Instruct
Mistral MoE 8x7B Instruct v0.1 model with Sparse Mixture of Experts. Fine tuned for instruction following
FireFunction V1
Fireworks' open-source function calling model.
Llama 2 70B Chat
A fine-tuned version of Llama 2 70B, optimized for dialogue applications using Reinforcement Learning from Human Feedback (RLHF), and perform comparably to ChatGPT according to human evaluations.
Mistral 7B Instruct
The Mistral-7B-Instruct-v0.1 Large Language Model (LLM) is a instruct fine-tuned version of the Mistral-7B-v0.1 generative text model using a variety of publicly available conversation datasets.
The fastest and most uncompromising AI platform!
Industry Leading Performance
Independently benchmarked to have the top speed of all inference providers
Enterprise Scale Throughput
FireLLaVA: the first commercially permissive OSS LLaVA model
State-of-the-art Models
Use powerful models curated by Fireworks or our in-house trained multi-modal and function-calling models
Battle Tested for Reliability
Built for Developers
Our OpenAI-compatible API makes it easy to start building with Fireworks!

