DeepSeek R1 0528, an updated version of the state-of-the-art DeepSeek R1 model, is now available. Try it now!

Customer Stories

Cursor

Cursor builds lightning fast code edits with Fireworks

Cursor’s Fast Apply feature lets developers instantly accept high-quality code suggestions with a single click. Powered by Fireworks’ speculative decoding, it delivers faster, more accurate edits—outperforming GPT-4 in both speed and usability.

Upwork

Upwork delivers faster, smarter proposals for freelancers

Upwork, the world’s largest freelance marketplace, built Uma to help freelancers craft better proposals faster. With Fireworks, Uma delivers real-time, personalized proposal generation—tailored to each freelancer’s skills and the job at hand—boosting match quality, efficiency, and success rates across the platform.

Sourcegraph

Sourcegraph increases completion speed and accuracy with Fireworks

Sourcegraph, the code intelligence platform uses Fireworks to power real-time, high-quality code completions. The result: 30% lower latency and 2.5× higher acceptance rates—driving faster workflows and a better developer experience across their enterprise products.

Cresta

How Cresta delivered 100× more efficient agent guidance

Cresta, the AI platform for contact centers, uses Fireworks to power Knowledge Assist—real-time, context-aware guidance for agents by unifying information from multiple sources. With Fireworks’ scalable infrastructure and Multi-LoRA tech, Cresta cut costs by up to 100× versus GPT-4—boosting agent efficiency and customer satisfaction at scale.

What our customers are saying

“Fireworks has been an amazing partner getting our Fast Apply and Copilot++ models running performantly. They were a cut above other competitors we tested on performance. We’ve done extensive testing on their quantized model quality for our use cases and have found minimal degradation. Additionally, Fireworks has been a key partner to help us implement task specific speed ups and new architectures, allowing us to achieve bleeding edge performance!”

"We've had a really great experience working with Fireworks to host open source models, including SDXL, Llama, and Mistral. After migrating one of our models, we noticed a 3x speedup in response time, which made our app feel much more responsive and boosted our engagement metrics."

"Fireworks has been a fantastic partner in building AI dev tools at Sourcegraph. Their fast, reliable model inference lets us focus on fine-tuning, AI-powered code search, and deep code context, making Cody the best AI coding assistant. They are responsive and ship at an amazing pace."