Quick deployment
Deploy your LoRA fine-tuned model with one simple command:Your deployment will be ready to use once it completes, with performance that matches the base model.
Alternative deployment method
This two-step method is the standard approach for multi-LoRA deployments where multiple LoRA models share the same base model. While it can also be used for single LoRA deployments, it provides slower performance compared to live merge and is not recommended for single LoRA use cases.
1
Create base model deployment
Deploy the base model with addons enabled:
2
Load LoRA addon
Once the deployment is ready, load the LoRA model onto the deployment:
Deployment with the Build SDK
You can also deploy your LoRA fine-tuned model using the Build SDK:The
id
parameter can be any simple string - it does not need to follow the format "accounts/account_id/deployments/model_id"
.When to use live merge
Use live merge deployment when you:- Have a single fine-tuned model to serve
- Need optimal performance that matches the base model
- Want the simplest deployment process
- Don’t require sharing a base model across multiple LoRA models
The live merge deployment method is designed for dedicated deployments with a single LoRA model. For multiple LoRA models sharing the same base model, consider using multi-LoRA deployment.