Collaboration happens naturally here. Big ideas grow best when we solve problems together.
Real Connections, Every Day
From daily standups to after-hours hangs, genuine teamwork drives everything we do.
Team Offsites That Energize
We bring people together beyond the screen — to connect, recharge, and build stronger teams.
Inside Our Creative Space
Our offices are designed for focus, flow, and spontaneous moments of inspiration.
Open roles
A career at FireworksAI offers the opportunity to work closely with some of the best minds within the scientific community and beyond. We’re looking for people from all backgrounds who want to make a real, positive impact on the world.
Instantly run popular and specialized models, including Llama3, Mixtral, and Stable Diffusion, optimized for peak latency, throughput, and context length. FireAttention, our custom CUDA kernel, serves models four times faster than vLLM without compromising quality.
Instantly run popular and specialized models, including Llama3, Mixtral, and Stable Diffusion, optimized for peak latency, throughput, and context length.