Qwen 3.7 Plus is now available on Serverless, exclusively on Fireworks. Try it today.

Model Library
/Google/Gemma 2 9B Instruct
Google Ai Logo Mark

Gemma 2 9B Instruct

Ready
model path:accounts/fireworks/models/gemma2-9b-it

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. They are text-to-text, decoder-only large language models, available in English, with open weights, pre-trained variants, and instruction-tuned variants. Gemma models are well-suited for a variety of text generation tasks, including question answering, summarization, and reasoning. Gemma 2 9B Instruct is the instruction-tuned version of Gemma 2 9B and has the chat completions API enabled.

Gemma 2 9B Instruct API Features

On-demand Deployment

Docs

On-demand deployments allow you to use Gemma 2 9B Instruct on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Metadata

State
Ready
Created on
6/28/2024
Kind
Base model
Provider
Google

Specification

Calibrated
No
Mixture-of-Experts
No
Parameters
10.1B

Supported Functionality

Fine-tuning
Not supported
Serverless
Not supported
Context Length
8.19k tokens
Function Calling
Not supported
Embeddings
Not supported
Rerankers
Not supported
Support image input
Not supported