Gemma 4 E4B is a pre-trained multimodal base model by Google DeepMind with 8B total / 4.5B effective parameters, a 128K context window, and support for text, image, audio, and video input across 140+ languages.
On-demand DeploymentDocs | On-demand deployments allow you to use Gemma 4 E4B on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |