GLM 5.2 is live! Opus-level intelligence at open-source rates. Pay per token on serverless. Try it today.

Fireworks Blog

glm fireworks lockup

GLM 5.2 is live on Fireworks inference, day zero.

Building an open-source Browser Agent on Fireworks AI
Developer Experience
5/21/2025

Building an open-source Browser Agent on Fireworks AI

Agentic AI Systems
Developer Experience
5/19/2025

Agentic AI Systems

Supervised Fine-Tuning (SFT) with LoRA on Fireworks AI: Tutorial
Developer Experience
5/12/2025

Supervised Fine-Tuning (SFT) with LoRA on Fireworks AI: Tutorial

Qwen 3 on Fireworks AI
Model Releases
5/6/2025

Qwen 3 on Fireworks AI: Controllable Chain-of-Thought and Tool Calling at Frontier Scale

Llama 4 Maverick on Fireworks AI
Developer Experience
4/28/2025

Optimizing Llama 4 Maverick on Fireworks AI

RAG application using MongoDB Atlas and Fireworks AI
Developer Experience
4/9/2025

Building Enterprise-Scale RAG Systems with Fireworks AI and MongoDB Atlas

Fireworks AI Now Supports NVIDIA NIM Deployments for Blazing AI Inference
Model Releases
3/18/2025

Fireworks AI Now Supports NVIDIA NIM Deployments for Blazing AI Inference

Faster, more efficient DeepSeek on the Fireworks AI Developer Cloud
Model Releases
3/18/2025

Faster, more efficient DeepSeek on the Fireworks AI Developer Cloud

Fine-Tuning DeepSeek v3 & R1 to optimize quality, latency, & cost
Model Releases
3/12/2025

Fine-Tuning DeepSeek v3 & R1 to optimize quality, latency, & cost

Enabling Function Calling in DeepSeek v3: Bridging the Gap Between Text and Action
Model Releases
2/14/2025

Enabling Function Calling in DeepSeek v3: Bridging the Gap Between Text and Action

DeepSeek v3 and R1 Model Architecture: Why it's powerful and economical
Developer Experience
2/7/2025

DeepSeek v3 and R1 Model Architecture: Why it's powerful and economical

DeepSeek R1 Just Got Eyes with Fireworks AI Document Inlining
Model Releases
2/5/2025

DeepSeek R1 Just Got Eyes with Fireworks AI Document Inlining

From text to task: Constrained generation for structured extraction in R1
Developer Experience
2/1/2025

From text to task: Constrained generation for structured extraction in R1

Distillation with Reasoning: Can DeepSeek R1 Teach Better Than Humans?
Developer Experience
1/31/2025

Distillation with Reasoning: Can DeepSeek R1 Teach Better Than Humans?

Mistral Small 3 Now Available on Fireworks: Faster, Lighter, and More Efficient
Model Releases
1/30/2025

Mistral Small 3 Now Available on Fireworks: Faster, Lighter, and More Efficient

Beyond Supervised Fine Tuning: How Reinforcement Learning Empowers AI with Minimal Labels
Developer Experience
1/27/2025

Beyond Supervised Fine Tuning: How Reinforcement Learning Empowers AI with Minimal Labels

DeepSeek R1: All you need to know 🐳
Model Releases
1/24/2025

DeepSeek R1: All you need to know 🐳

Real-time, performant code assistance: How Sourcegraph scaled with Fireworks AI
Case Studies
1/22/2025

Real-time, performant code assistance: How Sourcegraph scaled with Fireworks AI

DeepSeek V3 just got vision capabilities!
Model Releases
12/18/2024

DeepSeek V3 just got vision capabilities!

20x faster Whisper than OpenAI - Fireworks audio transcribes 1 hour in 4 seconds
Model Releases
12/9/2024

20x faster Whisper than OpenAI - Fireworks audio transcribes 1 hour in 4 seconds

How Cresta drives millions of real-time, AI-powered contact center interactions with Fireworks
Case Studies
12/8/2024

How Cresta drives millions of real-time, AI-powered contact center interactions with Fireworks