Join the Fireworks Startups Program and unlock credits, expert support, and community to scale fast. Join here

OpenAi Logo MArk

OpenAI gpt-oss-20b

Welcome to the gpt-oss series, OpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. gpt-oss-20b is used for lower latency, and local or specialized use-cases.

Try Model

Fireworks Features

Fine-tuning

OpenAI gpt-oss-20b can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

Learn More

Serverless

Immediately run model on pre-configured GPUs and pay-per-token

Learn More

On-demand Deployment

On-demand deployments give you dedicated GPUs for OpenAI gpt-oss-20b using Fireworks' reliable, high-performance system with no rate limits.

Learn More

gpt-oss-20b FAQs

Info & Pricing

Provider

OpenAI

Model Type

LLM

Context Length

131072

Serverless

Available

Fine-Tuning

Available

Pricing Per 1M Tokens Input/Output

$0.07 / $0.3