The definition of the deployment.

interface DeploymentEntity {
    asset?: WatsonXAI.ModelRel;
    base_model_id?: string;
    custom?: WatsonXAI.JsonObject;
    deployed_asset_type?: string;
    hardware_request?: WatsonXAI.HardwareRequest;
    hardware_spec?: WatsonXAI.HardwareSpec;
    online: WatsonXAI.OnlineDeployment;
    prompt_template?: WatsonXAI.SimpleRel;
    status?: WatsonXAI.DeploymentStatus;
    verbalizer?: string;
}

Properties

A reference to a resource.

base_model_id?: string

The base model that is required for this deployment if this is for a prompt template or a prompt tune for an IBM foundation model.

User defined properties specified as key-value pairs.

deployed_asset_type?: string

The type of the deployed model. The possible values are the following:

  1. prompt_tune - when a prompt tuned model is deployed.

  2. foundation_model - when a prompt template is used on a pre-deployed IBM provided model.

  3. custom_foundation_model - when a custom foundation model is deployed.

hardware_request?: WatsonXAI.HardwareRequest

The requested hardware for deployment.

hardware_spec?: WatsonXAI.HardwareSpec

A hardware specification.

Indicates that this is an online deployment. An object has to be specified but can be empty. The serving_name can be provided in the online.parameters.

prompt_template?: WatsonXAI.SimpleRel

A reference to a resource.

Specifies the current status, additional information about the deployment and any failure messages in case of deployment failures.

verbalizer?: string

The verbalizer that was used to train this model if the deployment has deployed_asset_type of prompt_tune.