GLM 5.2 is live! Opus-level intelligence at open-source rates. Pay per token on serverless. Try it today.

Model Library
/Google/Gemma 7B Instruct
Google Ai Logo Mark

Gemma 7B Instruct

Ready
model path:accounts/fireworks/models/gemma-7b-it

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. They are text-to-text, decoder-only large language models, available in English, with open weights, pre-trained variants, and instruction-tuned variants. Gemma models are well-suited for a variety of text generation tasks, including question answering, summarization, and reasoning. Their relatively small size makes it possible to deploy them in environments with limited resources such as a laptop, desktop or your own cloud infrastructure, democratizing access to state of the art AI models and helping foster innovation for everyone.

Gemma 7B Instruct API Features

On-demand Deployment

Docs

On-demand deployments allow you to use Gemma 7B Instruct on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State
Ready
Created on
2/22/2024
Kind
Base model
Provider
Google
Hugging Face
google/gemma-7b-it

Specification

Calibrated
No
Mixture-of-Experts
No
Parameters
7B

Supported Functionality

Fine-tuning
Not supported
Serverless
Not supported
Context Length
8.19k tokens
Function Calling
Not supported
Embeddings
Not supported
Rerankers
Not supported
Support image input
Not supported