Qwen2.5-VL is a multimodal large language model series developed by Qwen team, Alibaba Cloud, available in 3B, 7B, 32B, and 72B sizes
Fine-tuningDocs | Qwen2.5-VL 72B Instruct can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model |
On-demand DeploymentDocs | On-demand deployments give you dedicated GPUs for Qwen2.5-VL 72B Instruct using Fireworks' reliable, high-performance system with no rate limits. |
Qwen2.5-VL 72B Instruct is a multimodal instruction-tuned model developed by Qwen (Alibaba Group). It is the largest model in the Qwen2.5-VL series, supporting vision-language tasks including image, video, and document understanding .
This model is optimized for:
Note: YaRN is not recommended for tasks requiring precise visual localization
On Fireworks, the model supports the full 128K context window on on-demand deployments.
The model has 73.4 billion parameters.
Yes. Fireworks supports LoRA-based fine-tuning on dedicated GPUs for this model.
The model is released under the Tongyi Qianwen license.