Skip to main content

Llama 3.3 70B Instruct just dropped, featuring improved reasoning, math, and instruction-following. Try it out!

Announcing custom models and on-demand H100s with 50%+ lower costs and latency than  vLLM

Announcing custom models and on-demand H100s with 50%+ lower costs and latency than vLLM

By Ray Thai|6/3/2024

Loading...