Kimi K2 Instruct, a 1T parameter model with state of the art quality for coding, reasoning, and agentic tool use, is now available on Fireworks! Try now

Microsoft Logo Mark

Phi-3.5 Vision Instruct

Phi-3-Vision-128K-Instruct is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision. The model belongs to the Phi-3 model family, and the multimodal version comes with 128K context length (in tokens) it can support. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures.

Try Model

Fireworks Features

On-demand Deployment

On-demand deployments give you dedicated GPUs for Phi-3.5 Vision Instruct using Fireworks' reliable, high-performance system with no rate limits.

Learn More

Info

Provider

Microsoft

Model Type

LLMVision

Context Length

32064

Pricing Per 1M Tokens

$0.2