Updated FP8 version of Qwen3-30B-A3B non-thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities
Qwen3 30B A3B Instruct 2507 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
Learn MoreImmediately run model on pre-configured GPUs and pay-per-token
Learn MoreOn-demand deployments give you dedicated GPUs for Qwen3 30B A3B Instruct 2507 using Fireworks' reliable, high-performance system with no rate limits.
Learn MoreQwen
262144
Available
Available
$0.5