Cloud Pak for Data Watson Studio Job Information
Monitor Summary
This monitor will generate the following observations:
- Number of Cloud Pak for Data Watson Studio Jobs across all projects
- Number of Cloud Pak for Data Watson Studio Active Jobs across all projects
- Number of Cloud Pak for Data Watson Studio Jobs per project
- Number of Cloud Pak for Data Watson Studio Active Jobs per project
- Run Status of Last Cloud Pak for Data Watson Studio Job
Monitor Requirements
In order for Watson Studio Job Information to be available, CCS (Common Core Services) must be available on the Cloud Pak for Data instance. Catridges like Watson Studio and Watson Machine Learning automatically install CCS as part of their dependencies. If CCS is not available, no watson studio job information is available.
Administrative Credential Requirements
In order to retrieve the monitor data, the credential of admin user will be used. The default password of admin user is saved in the secret admin-user-details. You could use the below command to query the default password of the admin user..
oc extract secret/admin-user-details --keys=initial_admin_password --to=-
Once the password of the admin user is changed, please also change the password in the secret admin-user-details.
oc set data secret/admin-user-details initial_admin_password=<new password>
Deployment of the monitor
This monitor can be deployed using the IBM Cloud Pak Deployer or manually deployed using oc commands:
Monitor Source
Source folder containing the script can be found here (cp4d-watsonstudio-job-info).
Generated observations
Once the monitor is deployed, the following observations are available in IBM Cloud Pak for Data Metrics:
Using the IBM Cloud Pak for Data Platform Management Events:
https://<CP4D-BASE-URL>/zen/#/platformManagement/events
On the Platform Management Events page the following entries are added:
Event | Event Type | Possible values | Description |
---|---|---|---|
Cloud Pak for Data Watson Studio Jobs Overall Count | Number of Cloud Pak for Data Watson Studio Jobs across all projects | info | The Cloud Pak for Data Platform Projects and Jobs information are requested. |
Cloud Pak for Data Watson Studio Active Jobs Overall Count | Number of Cloud Pak for Data Watson Studio Active Jobs across all projects | info | The Cloud Pak for Data Platform Projects and Jobs information are requested. |
Cloud Pak for Data Watson Studio Jobs Project Count | Number of Cloud Pak for Data Watson Studio Jobs per project | info | The Cloud Pak for Data Platform Projects and Jobs information are requested. |
Cloud Pak for Data Watson Studio Active Jobs Project Count | Number of Cloud Pak for Data Watson Studio Active Jobs per project | info | The Cloud Pak for Data Platform Projects and Jobs information are requested. |
Cloud Pak for Data Watson Studio Job Last Status | Run Status of Last Cloud Pak for Data Watson Studio Job | info | The Cloud Pak for Data Platform Projects, Jobs, and Job Run information are requested. |
Using the IBM Cloud Pak for Data Prometheus endpoint
https://<CP4D-BASE-URL>/zen/metrics
It will generate 2 types of metrics:
- Actual Result metrics
These metrics contain the observed values - Zen Watchdog metrics
These metrics are used by the Cloud Pak for Data Zen Watchdog to monitor the state and trigger notifications
Actual Result metrics:
- cp4d_watsonstudio_jobs_overall_count Provides number of watson studio jobs across all projects
- cp4d_watsonstudio_active_jobs_overall_count Provides number of watson studio active jobs across all projects
- cp4d_watsonstudio_jobs_project_count Provides number of watson studio jobs per project
- cp4d_watsonstudio_active_jobs_project_count Provides number of watson studio active jobs per project
- cp4d_watsonstudio_job_last_status Provides run status of the last watson studio job.
# HELP cp4d_watsonstudio_active_jobs_overall_count# TYPE cp4d_watsonstudio_active_jobs_overall_count gaugecp4d_watsonstudio_active_jobs_overall_count{event_type="cp4d_watsonstudio_active_jobs_overall_count",monitor_type="cp4dplatformwatsonstudiojobinfo",reference="All Watson Studio Projects"} 0# HELP cp4d_watsonstudio_active_jobs_project_count# TYPE cp4d_watsonstudio_active_jobs_project_count gaugecp4d_watsonstudio_active_jobs_project_count{event_type="cp4d_watsonstudio_active_jobs_project_count",monitor_type="cp4dplatformwatsonstudiojobinfo",reference="test-project"} 0# HELP cp4d_watsonstudio_job_last_start_epoch_time# TYPE cp4d_watsonstudio_job_last_start_epoch_time gaugecp4d_watsonstudio_job_last_start_epoch_time{event_type="cp4d_watsonstudio_job_last_status",monitor_type="cp4dplatformwatsonstudiojobinfo",reference="test-project_one-time-job-1"} 1.652924515e+09
Zen Watchdog metrics (used in platform management events)
- watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_jobs_overall_count
- watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_active_jobs_overall_count
- watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_jobs_project_count
- watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_active_jobs_project_count
- watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_job_last_status
Zen Watchdog metrics can have the following values:
- 2 (info)
- 1 (warning)
- 0 (critical)
# HELP watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_active_jobs_overall_count# TYPE watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_active_jobs_overall_count gaugewatchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_active_jobs_overall_count{event_type="cp4d_watsonstudio_active_jobs_overall_count",monitor_type="cp4dplatformwatsonstudiojobinfo",reference="All Watson Studio Projects"} 2# HELP watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_active_jobs_project_count# TYPE watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_active_jobs_project_count gaugewatchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_active_jobs_project_count{event_type="cp4d_watsonstudio_active_jobs_project_count",monitor_type="cp4dplatformwatsonstudiojobinfo",reference="test-project"} 2# HELP watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_job_last_status# TYPE watchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_job_last_status gaugewatchdog_cp4dplatformwatsonstudiojobinfo_cp4d_watsonstudio_job_last_status{event_type="cp4d_watsonstudio_job_last_status",monitor_type="cp4dplatformwatsonstudiojobinfo",reference="test-project_one-time-job-1"} 2