The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
Immediately run model on pre-configured GPUs and pay-per-token
Learn MoreOn-demand deployments give you dedicated GPUs for Llama 4 Maverick Instruct (Basic) using Fireworks' reliable, high-performance system with no rate limits.
Learn MoreMeta
1M
Available
$0.22 / $0.88