ODCS Generator#
Tools to automatically generate ODCS (Open Data Contract Standard) v3.1.0 compliant YAML files from data catalog metadata.
Overview#
The odcs_generator module provides automated generation of Open Data Contract Standard (ODCS) files from enterprise data catalogs. It extracts metadata from catalog systems and transforms it into standardized data contracts.
Key Features#
- Multi-Catalog Support
Generate ODCS files from Collibra and Informatica CDGC data catalogs.
- Automatic Metadata Extraction
Fetch asset details, attributes, relations, and classifications automatically.
- Column Discovery
Automatically discover and document table columns through catalog relations.
- Data Type Mapping
Intelligent mapping of catalog data types to ODCS standard types.
- Classification Support
Extract and include data classifications and sensitivity labels.
- ODCS v3.1.0 Compliance
Generate fully compliant ODCS YAML files ready for use.
Quick Start#
Collibra Integration
from wxdi.odcs_generator.generate_odcs_from_collibra import CollibraClient, ODCSGenerator
# Initialize client
client = CollibraClient(
base_url="https://your-instance.collibra.com",
username="your_username",
password="your_password"
)
# Create generator
generator = ODCSGenerator(client)
# Generate ODCS
odcs_data = generator.generate_odcs("asset-id")
# Save to file
generator.save_to_yaml(odcs_data, "output.yaml")
Informatica Integration
from wxdi.odcs_generator.generate_odcs_from_informatica import InformaticaClient, ODCSGenerator
# Initialize client
client = InformaticaClient(
base_url="https://your-informatica-instance.com",
username="your_username",
password="your_password"
)
# Create generator
generator = ODCSGenerator(client)
# Generate ODCS
odcs_data = generator.generate_odcs("asset-id")
Use Cases#
- Data Contract Automation
Automatically generate data contracts from existing catalog metadata.
- Catalog Migration
Export catalog metadata to standardized ODCS format for migration.
- Documentation Generation
Create comprehensive data documentation from catalog assets.
- Compliance Reporting
Generate standardized contracts for compliance and governance.
- Data Product Onboarding
Accelerate data product creation with automated contract generation.
Supported Catalogs#
- Collibra
Asset metadata extraction
Column discovery via relations
Data classifications via GraphQL
Tag integration
Custom attributes
- Informatica CDGC
Asset metadata extraction
Column schema discovery
System attributes
Technical metadata
Business glossary terms
What is ODCS?#
The Open Data Contract Standard (ODCS) is an open-source specification for defining data contracts. It provides:
Standardized Format: Common structure for data contracts across organizations
Schema Definition: Detailed column-level metadata and constraints
Quality Rules: Data quality expectations and validation rules
Service Level Agreements: Performance and availability commitments
Governance: Data ownership, stewardship, and compliance information
ODCS v3.1.0 Structure#
id: unique-contract-id
kind: DataContract
apiVersion: v3.1.0
domain: domain-name
dataProduct: product-name
version: 1.0.0
name: contract-name
status: active
description:
authoritativeDefinitions:
- type: source-system
url: source-url
schema:
- id: table-id
name: table-name
columns:
- id: column-id
name: column-name
logicalType: string
physicalType: VARCHAR(255)
description: column description
classification: PII
quality:
- id: rule-id
name: rule-name
type: completeness
column: column-name
Next Steps#
Collibra Integration - Collibra integration guide
Informatica Integration - Informatica integration guide
Examples - Complete code examples
ODCS Generator Reference - API reference