Qwen3 30B-A3B
accounts/fireworks/models/qwen3-30b-a3b
Latest Qwen3 state of the art model, 30B with 3B active parameter model
Qwen3 30B-A3B is available via Fireworks' serverless API, where you pay per token. There are several ways to call the Fireworks API, including Fireworks' Python client, the REST API, or OpenAI's Python client.
On-demand deployments allow you to use Qwen3 30B-A3B on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.
Immediately run model on pre-configured GPUs and pay-per-token
accounts/fireworks/models/qwen3-30b-a3b
import requests import json url = "https://api.fireworks.ai/inference/v1/chat/completions" payload = { "model": "accounts/fireworks/models/qwen3-30b-a3b", "max_tokens": 5000, "top_p": 1, "top_k": 40, "presence_penalty": 0, "frequency_penalty": 0, "temperature": 0.6, "messages": [ { "role": "user", "content": "Hello, how are you?" } ] } headers = { "Accept": "application/json", "Content-Type": "application/json", "Authorization": "Bearer <API_KEY>" } requests.request("POST", url, headers=headers, data=json.dumps(payload))