PaddleOCR-VL-1.6 is a compact ~0.9B vision-language model for document parsing (OCR, tables, formulas, charts, seals); SOTA on OmniDocBench v1.6. Supports image input, 131K context.
On-demand DeploymentDocs | On-demand deployments allow you to use PaddleOCR VL 1.6 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |