Voyage Multimodal 3.5 is a next-generation multimodal embedding model built for retrieval over text, images, and videos. It embeds interleaved text and images (screenshots, PDFs, tables, figures, slides), and adds explicit support for video frames.
On-demand DeploymentDocs | On-demand deployments allow you to use Voyage Multimodal 3.5 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |