Qwen3 VL 235B A22B Thinking is a state-of-the-art vision-language model with 22 billion activated parameters and 235 billion total parameters. It enables enhanced visual perception and reasoning, supporting contexts up to 256K tokens. To ensure sufficient GPU memory capacity, we recommend deploying this model on 8 NVIDIA H200 GPUs.
On-demand deployments give you dedicated GPUs for Qwen3 VL 235B A22B Thinking using Fireworks' reliable, high-performance system with no rate limits.
Qwen
$0.5