ERNIE-4.5-21B-A3B is a text MoE Post-trained model, with 21B total parameters and 3B activated parameters for each token.
On-demand DeploymentDocs | On-demand deployments allow you to use ERNIE-4.5-21B-A3B-PT on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |