

Factory is building agent-native software development. Its agents, called Droids, run across the full software development lifecycle: triage, planning, code generation, validation, release, and monitoring. The goal is a software factory that ingests continuous signals and ships production software, with human judgment applied only where it is truly required. Engineering teams at Adobe, Adyen, Chainguard, Clari, Nvidia, Writer and many more already build on it.
Two principles separate Factory from single-model coding tools.
"At Factory, our mission is to bring autonomy to software engineering: to move teams from manually writing code by hand to a world where agents, we call them Droids, fully automate engineering with a high quality bar and humans in the loop." - Leo Tchourakov, Member of Technical Staff at Factory
Leo Tchourakov is a Research Engineer at Factory, where he integrates and evaluates every new model the platform supports, testing each one for quality before it reaches customers.
Open-weight models have caught up. Leo observes that late last year they trailed the best frontier models by roughly nine months, and today they post benchmark results comparable to top frontier models from a quarter earlier, at a fraction of the cost. Adoption followed the capability curve, with the open-weight share of Factory usage rising 2-3x in six months. Two forces drive that shift.
What makes offering a choice of models hard is delivery. Every new model behaves differently behind a unified API, with its own reasoning and tracing formats, tool schemas, conversation handling, and quirks around opening PRs or working with unstaged git state. Factory has to absorb all of that and still make each model run reliably in one harness. Speed compounds the difficulty, because a model can go from announcement to heavy demand in a matter of hours, and Factory's users expect it live on Droid the day it ships. When a new model lands and it is not on Droid within the hour, the team hears about it on Twitter/X.
"It takes a lot to make these models available. AI progress is so rapid that the time from when we know a model is launching to when it actually launches is very short." - Leo Tchourakov
Factory trialed Fireworks against several inference providers before standardizing on it for open-weight models. Fireworks won on the criteria that matter for a strategy of offering the broadest possible model choice:
"Fireworks supports us by having these models available on basically day zero, typically well ahead of most other inference providers." - Leo Tchourakov
The headline outcome for Factory’s customers is cost efficiency. Routing a task to the right open model instead of a frontier default changes the economics, with a Kimi K2.7 model running a task at roughly 20% of frontier cost and a MiniMax model running it at roughly 6%. Factory Router, now in private preview, lowers cost 30-40% per average task by matching each task to the most cost-efficient model that clears the reliability bar. The routing data shows a roughly even split across customers, where a third of tasks are hard enough to warrant the latest frontier model, a third are routine enough for the most cost-efficient, and a third fall in between.

Throughput compounds the savings for customers. A ticket-triage automation processes about five times more work for the same spend on Kimi 2.6, and up to 15 times more on the most cost-efficient options like MiniMax. Quality holds up under that pressure, with Factory's harness consistently outscoring Claude Code on Terminal-Bench across the models it runs.
The effect shows up in the business. Model choice, delivered reliably on Fireworks, has become one of Factory's clearest differentiators in the enterprise market, and it has moved customers who were on the fence in using open models into active adoption.
"The choice of all the models has been one of the key differentiators helping Factory gain rapid adoption among enterprises. We've had almost 2x month-over-month growth for the last several months, and that speaks to how much having the choice of models really matters." - Leo Tchourakov
Factory Router is moving toward broad availability, extending automatic, cost-efficient model selection to every user without requiring expertise in each new release. The direction is set by economics, because agentic workflows scale token consumption, and open-weight models delivered on Fireworks are what keep the software-factory vision economically viable at enterprise scale. Large enterprises and small teams get the same outcome: more work automated, cost under control, and no dependence on a single vendor that can change the rules late in the game.
Fireworks turns that vision into something Factory can build on: day-zero access to every open model, delivered fast and reliably, with the GPU layer off Factory's plate.
"Open-weight models are going to be the main way engineering teams keep automating more while managing cost in a sustainable way. Fireworks is what enables us to deliver on that mission." - Leo Tchourakov
See how fast, reliable, day-zero access to open models can power your AI products. Sign up for Fireworks AI.