
Gemma 4 31B IT NVFP4 - NVIDIA 4-bit quantized variant of Google Gemma 4 31B Instruct for efficient inference
Fine-tuningDocs | Gemma 4 31B IT NVFP4 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model |
On-demand DeploymentDocs | On-demand deployments give you dedicated GPUs for Gemma 4 31B IT NVFP4 using Fireworks' reliable, high-performance system with no rate limits. |