Serverless 2.0 is live: control reliability & speed without reserved capacity. Get Started.

Customer Stories

Cursor

Cursor builds lightning fast code edits with Fireworks

Cursor’s Fast Apply feature lets developers instantly accept high-quality code suggestions with a single click. Powered by Fireworks’ speculative decoding, it delivers faster, more accurate edits—outperforming GPT-4 in both speed and usability.

Notion

Notion Reduces Latency 4x with Fireworks AI

Notion, the all-in-one workspace platform, partnered with Fireworks AI to fine-tune models, reducing latency from 2 seconds to 350 milliseconds. This enhancement enabled Notion to deliver faster, scalable AI features, supporting over 100 million users and aligning with their "vibe working" vision

Cursor

How Cresta delivered 100× more efficient agent guidance

Cresta, the AI platform for contact centers, uses Fireworks to power Knowledge Assist—real-time, context-aware guidance for agents by unifying information from multiple sources. With Fireworks’ scalable infrastructure and Multi-LoRA tech, Cresta cut costs by up to 100× versus GPT-4—boosting agent efficiency and customer satisfaction at scale.

Upwork

How Upwork and Fireworks deliver faster, smarter proposals for freelancers

Discover how Upwork leverages Fireworks' cutting-edge AI to deliver tailored, lightning-fast proposals, empowering freelancers to win more jobs and succeed in a competitive marketplace.

What our customers are saying

Cresta

"Fireworks' Multi-LoRA capabilities align with Cresta's strategy to deploy custom AI through fine-tuning cutting-edge base models. It helps unleash the potential of AI on private enterprise data."

Tim Shi
Tim Shi | Co-Founder at Cresta
Motif

“Using Fireworks AI on Foundry, we can run repeatable, high-volume evaluations through a single Azure endpoint, which helps our team move faster from deployment to informed model decisions with more confidence.”

Hanbin Jung | Partnership Lead at Motif
Cursor logo dark
why did Cursor rollout Composer 2 with @FireworksAI_HQ?

"...because it's way more performant than the open source engines and is what we use in production. our rl inference scales elastically and globally because of it. when we have low prod traffic we scale up RL, when we have high prod traffic, we scale down RL."

federico cassano
Federico Cassano | AI Researcher at Cursor
Vercel Dark

"Vercel’s v0 model is a composite model. The SOTA in this space changes every day, so you don’t want to tie yourself to a single model. Using a fine-tuned reinforcement learning model with Fireworks, we perform substantially better than SOTA. In our evaluation, Sonnet 3.5 compiled at 62%, and we got our error-free generation rate well into the 90s."

Malte Ubl, CTO at Vercel
Malte Ubl | CTO at Vercel
Notion logo dark

"By partnering with Fireworks to fine-tune models, we reduced latency from about 2 seconds to 350 milliseconds, significantly improving performance and enabling us to launch AI features at scale. That improvement is a game changer for delivering reliable, enterprise-scale AI"

Sarah Sachs
Sarah Sachs | AI Lead at Notion
genspark

"Fireworks enabled us to own our AI journey, and unlock better quality in just four weeks."

Kay Zhu
Kay Zhu | CTO at Genspark
Quora

"We've had a really great experience working with Fireworks to host open source models, including SDXL, Llama, and Mistral. After migrating one of our models, we noticed a 3x speedup in response time, which made our app feel much more responsive and boosted our engagement metrics."

SPENCER CHAN
Spencer Chan | Product Lead at Quora
Sourcegraph

"Fireworks has been a fantastic partner in building AI dev tools at Sourcegraph. Their fast, reliable model inference lets us focus on fine-tuning, AI-powered code search, and deep code context, making Cody the best AI coding assistant. They are responsive and ship at an amazing pace."

Beyang Liu Testimonial
Beyang Liu | CTO at Sourcegraph
Ui Path

By running Fireworks AI on Azure Foundry, UiPath powers both Autopilot and Delegate with open models that are significantly faster and more cost-efficient for Computer Use, all while matching the quality of Claude's Sonnet 4.6. It's a step-change in how we deliver AI at scale to our customers.

Neagovici-Negoescu
Mircea Neagovici-Negoescu | SVP, Head of AI at UiPath
Cursor logo dark

“Fireworks has been an amazing partner getting our Fast Apply and Copilot++ models running performantly. They exceeded other competitors we reviewed on performance. After testing their quantized model quality for our use cases, we have found minimal degradation. Fireworks helps implement task specific speed ups and new architectures, allowing us to achieve bleeding edge performance!”

Sualeh Asif Testimonial
Sualeh Asif | CPO at Cursor
genspark
"Fireworks enabled us to own our AI journey, and unlock better quality in just four weeks. This resulted in a better user experience for our customers."
Kay Zhu
Kay Zhu | CTO at Genspark
stackblitz

Fireworks AI on Microsoft Foundry gives us the inference throughput and latency we need to power Bolt at production scale and all within the Azure ecosystem.

Dominick Elm
Dominick Elm | Founding Engineer at StackBlitz (Bolt)
rLLM

"The rLLM team is dedicated to pushing the boundaries of autonomous AI, which means our time is best spent on innovation rather than managing backend clusters. The Fireworks Training SDK lets us focus on our research instead of wrestling with infrastructure. The platform is fast, well-optimized, and just works."

rLLM
Kyle Montgomery & Sijun Tan | Core Contributors, rLLM at rLLM
Cresta

"Fireworks' Multi-LoRA capabilities align with Cresta's strategy to deploy custom AI through fine-tuning cutting-edge base models. It helps unleash the potential of AI on private enterprise data."

Tim Shi
Tim Shi | Co-Founder at Cresta
Motif

“Using Fireworks AI on Foundry, we can run repeatable, high-volume evaluations through a single Azure endpoint, which helps our team move faster from deployment to informed model decisions with more confidence.”

Hanbin Jung | Partnership Lead at Motif
Cursor logo dark
why did Cursor rollout Composer 2 with @FireworksAI_HQ?

"...because it's way more performant than the open source engines and is what we use in production. our rl inference scales elastically and globally because of it. when we have low prod traffic we scale up RL, when we have high prod traffic, we scale down RL."

federico cassano
Federico Cassano | AI Researcher at Cursor
Vercel Dark

"Vercel’s v0 model is a composite model. The SOTA in this space changes every day, so you don’t want to tie yourself to a single model. Using a fine-tuned reinforcement learning model with Fireworks, we perform substantially better than SOTA. In our evaluation, Sonnet 3.5 compiled at 62%, and we got our error-free generation rate well into the 90s."

Malte Ubl, CTO at Vercel
Malte Ubl | CTO at Vercel