Learn how to connect to and query dedicated deployments that were created outside the SDK
firectl
or the Fireworks web UI, you can easily connect to them using the Build SDK to run inference. This is particularly useful when you want to leverage existing infrastructure or when deployments are managed by different teams.
LLM
instance with the deployment_type="on-demand"
and provide the deployment id
:
.apply()
- the deployment is already running and ready to serve requests..apply()
call needed - The deployment is already activeid
parameter should match exactly with your existing deployment:
model
parameter, it should match the model that your deployment is actually running:
Deployment not found error
Model mismatch warnings
model
parameter to match the deployed modelAuthentication errors
FIREWORKS_API_KEY
environment variable is set correctly