The definition of the deployment.

interface DeploymentEntity {
    asset?: WatsonXAI.Rel;
    base_model_id?: string;
    custom?: WatsonXAI.JsonObject;
    deployed_asset_type?: string;
    hardware_spec?: WatsonXAI.HardwareSpec;
    online: WatsonXAI.OnlineDeployment;
    prompt_template?: WatsonXAI.SimpleRel;
    status?: WatsonXAI.DeploymentStatus;
    verbalizer?: string;
}

Properties

asset?: WatsonXAI.Rel

A reference to a resource.

base_model_id?: string

The base model that is required for this deployment if this is for a prompt template or a prompt tune for an IBM foundation model.

User defined properties specified as key-value pairs.

deployed_asset_type?: string

The type of the deployed model. The possible values are the following: 1. prompt_tune - when a prompt tuned model is deployed. 2. foundation_model - when a prompt template is used on a pre-deployed IBM provided model.

hardware_spec?: WatsonXAI.HardwareSpec

A hardware specification.

Indicates that this is an online deployment. An object has to be specified but can be empty. The serving_name can be provided in the online.parameters.

prompt_template?: WatsonXAI.SimpleRel

A reference to a resource.

Specifies the current status, additional information about the deployment and any failure messages in case of deployment failures.

verbalizer?: string

The verbalizer that was used to train this model if the deployment has deployed_asset_type of prompt_tune.