Fireworks AI raises $250M Series C to power the future of enterprise AI. Read more

Model Library
/Microsoft/Phi-3.5 Vision Instruct
Microsoft Logo Mark

Phi-3.5 Vision Instruct

Ready
fireworks/phi-3-vision-128k-instruct

    Phi-3-Vision-128K-Instruct is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision. The model belongs to the Phi-3 model family, and the multimodal version comes with 128K context length (in tokens) it can support. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures.

    Fireworks Features

    On-demand Deployment

    Docs

    On-demand deployments give you dedicated GPUs for Phi-3.5 Vision Instruct using Fireworks' reliable, high-performance system with no rate limits.

    Metadata

    State
    Ready
    Created on
    5/29/2024
    Kind
    Base model
    Provider
    Microsoft
    Hugging Face
    Phi-3.5-vision-instruct

    Specification

    Calibrated
    No
    Mixture-of-Experts
    No
    Parameters
    4.2B

    Supported Functionality

    Fine-tuning
    Not supported
    Serverless
    Not supported
    Serverless LoRA
    Not supported
    Context Length
    32.1k tokens
    Function Calling
    Not supported
    Embeddings
    Not supported
    Rerankers
    Not supported
    Support image input
    Supported