
GLM-4.7-Flash is a 30B-A3B MoE model. As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.
On-demand DeploymentDocs | On-demand deployments allow you to use GLM-4.7 Flash on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |