Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Path Parameters
The Account Id
The Batch Inference Job Id
Query Parameters
The fields to be returned in the response. If empty or "*", all fields will be returned.
Response
A successful response.
The creation time of the batch inference job.
The email address of the user who initiated this batch inference job.
JobState represents the state an asynchronous job can be in.
JOB_STATE_UNSPECIFIED
, JOB_STATE_CREATING
, JOB_STATE_RUNNING
, JOB_STATE_COMPLETED
, JOB_STATE_FAILED
, JOB_STATE_CANCELLED
, JOB_STATE_DELETING
, JOB_STATE_WRITING_RESULTS
, JOB_STATE_VALIDATING
, JOB_STATE_DELETING_CLEANING_UP
, JOB_STATE_PENDING
, JOB_STATE_EXPIRED
, JOB_STATE_RE_QUEUEING
, JOB_STATE_CREATING_INPUT_DATASET
The name of the model to use for inference. This is required, except when continued_from_job_name is specified.
The name of the dataset used for inference. This is required, except when continued_from_job_name is specified.
The name of the dataset used for storing the results. This will also contain the error file.
Parameters controlling the inference process.
The update time for the batch inference job.
The precision with which the model should be served. If PRECISION_UNSPECIFIED, a default will be chosen based on the model.
PRECISION_UNSPECIFIED
, FP16
, FP8
, FP8_MM
, FP8_AR
, FP8_MM_KV_ATTN
, FP8_KV
, FP8_MM_V2
, FP8_V2
, FP8_MM_KV_ATTN_V2
, NF4
, FP4
, BF16
, FP4_BLOCKSCALED_MM
, FP4_MX_MOE
Job progress.
The resource name of the batch inference job that this job continues from. Used for lineage tracking to understand job continuation chains.