Medium-sized reasoning model from Qwen.
QWQ 32B can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model
Learn MoreImmediately run model on pre-configured GPUs and pay-per-token
Learn MoreOn-demand deployments give you dedicated GPUs for QWQ 32B using Fireworks' reliable, high-performance system with no rate limits.
Learn MoreQwen
LLM
131072
Available
Available
$0.9