Qwen3-VL series delivers superior text understanding & generation, deeper visual perception & reasoning, extended context length, enhanced spatial and video dynamics comprehension, and stronger agent interaction capabilities. Available in Dense and MoE architectures that scale from edge to cloud, with Instruct and reasoning‑enhanced Thinking editions.
Fine-tuningDocs | Qwen3 VL 30B A3B Thinking can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model |
On-demand DeploymentDocs | On-demand deployments allow you to use Qwen3 VL 30B A3B Thinking on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |