CLI Reference¶

The nexus command-line interface provides tools for managing and validating Nexus packages.

Global Options¶

The nexus CLI provides the following global behavior:

Help: Use --help with any command to see detailed usage information
Exit Codes: Commands exit with code 0 on success and 1 on failure

Getting Help¶

Command Help¶

Get help for any command:

nexus --help
nexus validate --help
nexus list --help
nexus get --help

Version Information¶

Check the installed version:

nexus --version

nexus validate¶

Validate the structure and configuration of a Nexus package directory.

Usage¶

nexus validate <package_path>

Arguments¶

package_path (required): Path to a Nexus package directory to validate

Examples¶

Validate a package in the current directory:

nexus validate .

Validate a package at a specific path:

nexus validate packages/my-package

nexus list¶

List various resources in Nexus packages.

Subcommands¶

nexus list packages¶

List all Nexus packages discovered in the packages directory.

Usage:

nexus list packages

Example:

nexus list packages

nexus list benchmark-packages¶

List all benchmark packages discovered across all Nexus packages.

Usage:

nexus list benchmark-packages

Example:

nexus list benchmark-packages

nexus list benchmark-experiments¶

List all benchmark experiments discovered across all Nexus packages.

Usage:

nexus list benchmark-experiments

Example:

nexus list benchmark-experiments

nexus get¶

Get specific information about Nexus packages.

Subcommands¶

nexus get benchmark-requirements¶

Get the list of benchmark requirement specifiers for a specific Nexus package.

Usage:

nexus get benchmark-requirements <package_name>

Arguments:

package_name (required): Name of the Nexus package

Example:

nexus get benchmark-requirements terratorch

nexus run¶

Execute benchmark instances.

Subcommands¶

nexus run benchmarks¶

Execute benchmarks from a GitHub Pull Request. This command identifies new or changed benchmark instances in a PR and optionally executes them using the ado CLI.

Usage:

nexus run benchmarks --pr <pr_url> [OPTIONS]

Warning

This command requires the GitHub CLI (gh) to be installed and configured in the command execution environment.

Required Options:

--pr <pr_url>: GitHub Pull Request URL (e.g., https://github.com/IBM/algorithm-nexus/pull/123)

Optional Flags:

--remote <path>: Execute operations on a remote Ray cluster using the specified configuration file. When provided, the command automatically installs the required benchmark packages in the Ray environment. Read Running ado on remote Ray clusters to discover the remote context configuration format.
--context <path>: Path to ADO context YAML file (samplestore context). Read Working with Contexts to discover how to manage contexts.
--dry-run: List benchmark instances without executing them (dry run)
--output-file <path>: Output file path for execution results. If not specified, results are printed to screen.
-o, --output-format <format>: Output format: 'json' or 'yaml'. Can be used with or without --output-file. When used without --output-file, prints formatted output to console. When used with --output-file, overrides format inference from file extension (defaults to json).

Behavior:

The command automatically:

Checks if the local repository is on the same commit as the PR
If not, checks out the PR code to a temporary directory
Analyzes the PR to find new or changed benchmark instances
Executes the benchmarks (unless --dry-run is specified)
Writes results to the output file

Benchmark Instance Detection:

A benchmark instance is detected as changed if any file within its directory is modified in the PR. This includes:

Changes to space.yaml files
Changes to any other files in the benchmark_instances/<instance-name>/ directory
New benchmark instance directories (any new folder under benchmark_instances/)

The detection works for both:

Model-level instances: packages/<package>/models/<model>/benchmark_instances/<instance>/
Package-level instances: packages/<package>/benchmark_instances/<instance>/

Note

When not running in remote mode (--remote not set), the benchmark instances will be executed with ado in the local environment. It is the user responsibility to ensure that the required benchmark packages are installed in the local python environment. Benchmark packages are listed for each nexus package in the nexus.yaml configuration.

Examples:

List benchmarks in a PR without executing (dry run):

nexus run benchmarks --pr https://github.com/IBM/algorithm-nexus/pull/123 --dry-run

Execute benchmarks locally:

nexus run benchmarks --pr https://github.com/IBM/algorithm-nexus/pull/123

Execute benchmarks on a remote Ray cluster:

nexus run benchmarks \
  --pr https://github.com/IBM/algorithm-nexus/pull/123 \
  --remote path/to/remote-context.yaml \
  --context path/to/ado-context.yaml

Execute benchmarks and save results to a file:

nexus run benchmarks \
  --pr https://github.com/IBM/algorithm-nexus/pull/123 \
  --output-file benchmark_results.json

Execute benchmarks and save results in YAML format:

nexus run benchmarks \
  --pr https://github.com/IBM/algorithm-nexus/pull/123 \
  --output-file results.yaml \
  --output-format yaml

Output Format:

By default, the command outputs results in a human-readable format showing the status, message, and IDs for each benchmark instance.

To get structured output (JSON or YAML), use the --output-format option:

# Print JSON to console
algorithm-nexus run packages/terratorch/models/prithvi/benchmark_instances/flood-test \
  --output-format json

# Print YAML to console
algorithm-nexus run packages/terratorch/models/prithvi/benchmark_instances/flood-test \
  --output-format yaml

The structured output (JSON) has the following format:

{
    "instances": [
        {
            "instance_path": "packages/<package>/models/<model>/benchmark_instances/test_benchmark",
            "status": "started",
            "message": "Successfully started on Ray cluster with job ID: raysubmit_snRVd4ZqTTKcaR3W | Space ID: space-a009d7-default",
            "space_id": "space-a009d7-default",
            "operation_id": "randomwalk-123456-default",
            "ray_job_id": "raysubmit_snRVd4ZqTTKcaR3W"
        }
    ]
}

Status Values:

success: Benchmark completed successfully (local execution)
started: Benchmark started on Ray cluster (remote execution)
failed: Benchmark execution failed
unknown: Status could not be determined

Execution Mode Comparison:

Field	Local Mode (`--remote` not set)	Remote Mode (`--remote` set)
`status`	`success` (on success) `failed` (on error) `unknown` (if undetermined)	`started` (on success) `failed` (on error) `unknown` (if undetermined)
`operation_id`	ADO operation ID	`null`
`ray_job_id`	`null`	Ray job ID
Notes	Operation completes locally	Job submitted to Ray cluster

When operation_id is null (in remote mode), users will need to inspect the Ray job logs to extract the ADO operation ID once execution completes.

In case of failure (either mode), the status will be failed and the message field will contain the reason for the failure.

Exit Codes:

0: All benchmarks executed successfully
1: One or more benchmarks failed or an error occurred
130: Interrupted by user (Ctrl+C)

Common Workflows¶

Validating a New Package¶

When creating a new Nexus package, validate its structure:

# Navigate to your package directory
cd packages/my-new-package

# Validate the package
nexus validate .

Listing Available Resources¶

View all packages and their resources:

# List all packages
nexus list packages

# List all benchmark packages
nexus list benchmark-packages

# List all benchmark experiments
nexus list benchmark-experiments

Getting Package Information¶

Retrieve specific information about a package:

# Get benchmark requirements for a package
nexus get benchmark-requirements my-package

Running Benchmarks from a Pull Request¶

Execute benchmarks from a GitHub PR to validate changes:

# First, check what benchmarks would be executed (dry run)
nexus run benchmarks \
  --pr https://github.com/IBM/algorithm-nexus/pull/123 \
  --dry-run

# Execute benchmarks locally
nexus run benchmarks \
  --pr https://github.com/IBM/algorithm-nexus/pull/123

# Execute benchmarks on a remote Ray cluster
nexus run benchmarks \
  --pr https://github.com/IBM/algorithm-nexus/pull/123 \
  --remote config/remote-context.yaml \
  --context config/ado-context.yaml \
  --output-file pr123_results.json

The command will:

Automatically detect if your local repo is on the PR commit
Checkout the PR code if needed (to a temporary directory)
Find all new or changed benchmark instances
Execute them and report results

Exit Codes¶

All commands follow standard Unix exit code conventions:

0: Success
1: Error (validation failed, resource not found, etc.)

CLI Reference¶

Global Options¶

Getting Help¶

Command Help¶

Version Information¶

nexus validate¶

Usage¶

Arguments¶

Examples¶

nexus list¶

Subcommands¶

nexus list packages¶

nexus list benchmark-packages¶

nexus list benchmark-experiments¶

nexus get¶

Subcommands¶

nexus get benchmark-requirements¶

nexus run¶

Subcommands¶

nexus run benchmarks¶

Common Workflows¶

Validating a New Package¶

Listing Available Resources¶

Getting Package Information¶

Running Benchmarks from a Pull Request¶

Exit Codes¶

Next Steps¶