MiniMax M2.5 is built for state-of-the-art coding, agentic tool use, search, and office work, extensively trained with reinforcement learning across hundreds of thousands of real-world environments to plan like an architect and generalize across unfamiliar scaffolding and tools. It delivers significantly faster task completion, improved token efficiency, and exceptional cost-effectiveness, making it well-suited for production-scale agentic applications and complex, multi-step workflows. PRICING: $0.3 / $0.03 / $1.20 per million tokens (input / cached / output).
ServerlessDocs | Immediately run model on pre-configured GPUs and pay-per-token |
On-demand DeploymentDocs | On-demand deployments give you dedicated GPUs for MiniMax-M2.5 using Fireworks' reliable, high-performance system with no rate limits. |
Run queries immediately, pay only for usage