voyage-multimodal-3.5 API & Playground

voyage-multimodal-3.5 is a high-accuracy multimodal embedding model for retrieval across text, images, PDFs, screenshots, tables, figures, slides, and videos. It is designed for production search and RAG workflows that need to retrieve visually rich and mixed-format content.

The model embeds text, visual documents, and video frames into a shared vector space, helping teams build retrieval systems where similarity reflects semantic meaning across modalities. It supports 2048, 1024, 512, and 256 output dimensions, along with multiple quantization options for efficient storage and retrieval.

voyage-multimodal-3.5 API Features

On-demand Deployment

Docs

On-demand deployments allow you to use voyage-multimodal-3.5 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

FAQs

Metadata

State

Ready

Created on

6/15/2026

Kind

Embedding model

Provider

Voyage AI by MongoDB

Specification

Calibrated

Mixture-of-Experts

Parameters

N/A

Supported Functionality

Fine-tuning

Not supported

Serverless

Not supported

Context Length

32.7k tokens

Function Calling

Not supported

Embeddings

Supported

Rerankers

Not supported

Support image input

Supported