Deploy your LoRA fine-tuned model with a single command that delivers performance matching the base model. This streamlined approach, called live merge, eliminates the previous two-step process and provides better performance compared to multi-LoRA deployments.

Quick deployment

Deploy your LoRA fine-tuned model with one simple command:

firectl create deployment "accounts/fireworks/models/<MODEL_ID of lora model>"

Your deployment will be ready to use once it completes, with performance that matches the base model.

Alternative deployment method

This two-step method is the standard approach for multi-LoRA deployments where multiple LoRA models share the same base model. While it can also be used for single LoRA deployments, it provides slower performance compared to live merge and is not recommended for single LoRA use cases.

You can also deploy single LoRA models using a two-step process:

1

Create base model deployment

Deploy the base model with addons enabled:

firectl create deployment "accounts/fireworks/models/<MODEL_ID of base model>" --enable-addons
2

Load LoRA addon

Once the deployment is ready, load the LoRA model onto the deployment:

firectl load-lora <MODEL_ID> --deployment <DEPLOYMENT_ID>

When to use live merge

Use live merge deployment when you:

  • Have a single fine-tuned model to serve
  • Need optimal performance that matches the base model
  • Want the simplest deployment process
  • Don’t require sharing a base model across multiple LoRA models

The live merge deployment method is designed for dedicated deployments with a single LoRA model. For multiple LoRA models sharing the same base model, consider using multi-LoRA deployment.