PREPARING
. A successfully prepared model will have the desired precision added
to the Precisions
list.
Precisions
field will indicate what precisions the model has been prepared for.
To use the quantized FP8 checkpoint, pass the --precision
flag: