GLM 5.2 is live! Opus-level intelligence at open-source rates. Pay per token on serverless. Try it today.

Fireworks Blog

glm fireworks lockup

GLM 5.2 is live on Fireworks inference, day zero.

RADPAIR and Fireworks Unlock Smarter Radiology Workflows
Case Studies
11/9/2025

Modernizing Healthcare with AI: How RADPAIR and Fireworks Unlock Smarter Radiology Workflows

Vercel and Fireworks Partnership
Case Studies
11/3/2025

40X Faster, and Smarter Outputs: How Vercel Turbocharged their Code Fixing Model with Open Models, Speculative Decoding and Reinforcement Fine Tuning on Fireworks

Genspark’s Deep Research Agent Outperforms a Frontier Closed Model in Quality and Tool Calls using Fireworks Reinforcement Fine Tuning, Achieving a 50% Cost Reduction
Case Studies
10/31/2025

Genspark’s Deep Research Agent Outperforms a Frontier Closed Model in Quality and Tool Calls using Fireworks RFT, Achieving a 50% Cost Reduction

Series C
Company News
10/28/2025

We raised $250M To Help Enterprises Own Their AI

Deploy NVIDIA Nemotron Nano 2 VL on Fireworks
Model Releases
10/27/2025

Accelerate your Vision Pipelines with the new NVIDIA Nemotron Nano 2 VL Model on Fireworks AI

Deployment Shapes One Click Deployment Configured for You
Developer Experience
10/23/2025

Deployment Shapes: One-Click Deployment Configured For You

fireworks amd
Partner Announcements
10/20/2025

Fireworks and AMD partner to power the next gen of AI infrastructure on AMD Instinct™ GPUs

LLM on the edge: Model picking with Fireworks Eval Protocol + Ollama
Developer Experience
10/15/2025

LLM on the edge: Model picking with Fireworks Eval Protocol + Ollama

Announcing Embeddings  and Reranking  on Fireworks AI
Model Releases
10/9/2025

Announcing Embeddings and Reranking On Fireworks AI

Deep-Dive into LLM Fine Tuning
Developer Experience
10/6/2025

Deep-Dive into LLM Fine-Tuning

Production-Ready AI Agents with Optimized Inference with AWS AgentCore
Developer Experience
10/2/2025

Production-Ready AI Agents with Optimized Inference with AWS AgentCore

Fireworks for Startups
Company News
10/1/2025

Launching Fireworks for Startups Program!

image
Developer Experience
9/22/2025

Traces Are All You Need (to rank LLMs)

Understanding Embeddings and Reranking at Scale
Developer Experience
9/12/2025

Understanding Embeddings and Reranking at Scale

DeepSeek V3.1
Model Releases
8/26/2025

DeepSeek V3.1 now on Fireworks AI!

Eval Driven Development with Claude Code
Developer Experience
8/25/2025

LLM Eval Driven Development with Claude Code

Your AI Benchmark is Lying to You. Here's How We Caught It
Benchmarks
8/15/2025

Your AI Benchmark is Lying to You. Here's How We Caught It

Test driven agent development with eval protocol
Developer Experience
8/14/2025

Test-Driven Agent Development with Eval Protocol

Quality first: how Fireworks.ai is the go-to place for gpt-oss
Developer Experience
8/12/2025

Quality first: how Fireworks.ai is the go-to place for gpt-oss

GPT-OSS Models
Model Releases
8/5/2025

Introducing OpenAI gpt-oss (20b & 120b)

Eval Protocol
Model Releases
8/4/2025

Announcing Eval Protocol