Learn how to integrate and use Fireworks AI inference capabilities in Amazon SageMaker.
brew install jq
)brew install git-lfs
then git lfs install
)curl -LsSf https://astral.sh/uv/install.sh | sh
)[YOUR_REGION]
with your actual region. For best compatibility, instance types with A100, H100, or H200 GPUs are recommended (ml.p4d*
, ml.p5*
).
ml.p5*
for H100/H200).
fireworks.json
describing your model configuration:tar.gz
archive (SageMaker expects this format) and upload to S3:AmazonSageMakerFullAccess
and continue.SageMakerFireworksRole
) and create it.[BUCKET_NAME]
with your bucket name (no s3://
).S3FireworksModelAccess
) and create it.
env_setup.sh
script to set up your local environment and add FIREWORKS_METERING_KEY
to your environment before running the deployment scripts.p4
, p5
, or other GPU endpoint quotas. Request increases and retrydocker push
fails with permission denied/*
object path; Ensure that you are pointing your s3_model_path
to the model.tar.gz
file