The limits that may be set per request.

interface ConsumptionsLimit {
    call_time?: string;
    max_input_tokens?: number;
    max_output_tokens?: number;
}

Properties

call_time?: string

The hard limit on the call time for a request, if set.

max_input_tokens?: number

The hard limit on the number of input tokens for a request, if set. A value of zero will disable this feature.

max_output_tokens?: number

The hard limit on the number of output tokens for a request, if set. A value of zero will disable this feature.