Qwen3 VL 235B A22B Instruct is a state-of-the-art vision-language model with 22 billion activated parameters and 235 billion total parameters. It enables enhanced visual perception and instruction-following, supporting contexts up to 256K tokens. To ensure sufficient GPU memory capacity, we recommend deploying this model on 8 NVIDIA H200 GPUs.
ServerlessDocs | Immediately run model on pre-configured GPUs and pay-per-token |
On-demand DeploymentDocs | On-demand deployments give you dedicated GPUs for Qwen3 VL 235B A22B Instruct using Fireworks' reliable, high-performance system with no rate limits. |
Run queries immediately, pay only for usage