watsonx.data
Solution report card
| Runs on IBM i? | ❌ | |
| On-prem | ✅ | |
| IBM Cloud | ✅ | |
| AI capabilities | Apache Spark Agentic AI many more… | |
| Commercial support | ✅ | |
| Free to try? | ✅ | |
| Requirements |
What is watsonx.data?
Section titled “What is watsonx.data?”watsonx.data is IBM’s data lakehouse platform — designed to consolidate data from multiple sources into a single, governed, queryable layer. It combines the flexibility of a data lake with the structure of a data warehouse, using open formats (Apache Iceberg, Parquet) and open query engines (Presto/Trino, Apache Spark) to avoid vendor lock-in.
For IBM i environments, watsonx.data serves as a data federation and analytics layer that can query Db2 for i data alongside data from other systems — cloud object storage, relational databases, streaming platforms — using a single SQL interface.
Key capabilities
Section titled “Key capabilities”Federated query across data sources
Section titled “Federated query across data sources”watsonx.data’s Presto/Trino engine can query Db2 for i directly via the IBM Db2 for i connector, without requiring data migration. This enables cross-system analytics queries that join IBM i data with data from cloud storage, other databases, or Kafka topics.
Apache Spark for AI and data engineering
Section titled “Apache Spark for AI and data engineering”A managed Spark environment within watsonx.data enables large-scale data processing, feature engineering for ML pipelines, and ETL workflows — all operating over data that includes IBM i Db2 tables.
Open table formats
Section titled “Open table formats”Data registered in watsonx.data using Apache Iceberg is accessible to any tool that understands the format — including watsonx.ai for model training, BI tools like Cognos, and external data science environments.
Data governance
Section titled “Data governance”watsonx.data integrates with IBM’s governance capabilities (via watsonx.governance) to enforce data access policies, track data lineage, and maintain a business glossary — important for regulated industries where IBM i is common.
Connecting watsonx.data to IBM i
Section titled “Connecting watsonx.data to IBM i”See Accessing Db2 from watsonx.data for step-by-step instructions, and IBM Cloud Satellite Connector for connecting an on-premises IBM i to watsonx.data running in IBM Cloud.