voyage-4-lite is a lightweight, general-purpose embedding model optimized for low latency and cost. Enabled by Matryoshka learning and quantization-aware training, voyage-4-lite supports embeddings in 2048, 1024, 512, and 256 dimensions, with multiple quantization options
On-demand DeploymentDocs | On-demand deployments allow you to use Voyage 4 Lite on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |