Updated FP8 version of Qwen3-30B-A3B thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities
Qwen3 30B A3B Thinking 2507 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
Learn MoreImmediately run model on pre-configured GPUs and pay-per-token
Learn MoreOn-demand deployments give you dedicated GPUs for Qwen3 30B A3B Thinking 2507 using Fireworks' reliable, high-performance system with no rate limits.
Learn MoreQwen
Available
Available
$0.9