Skip to content

For Data Stewards

The roche-data platform automates the infrastructure work that surrounds your expertise — so you can focus on the governance decisions that actually require human judgement.

Today, bringing a new data product to life requires you to:

  1. Manually define data contracts in documents or spreadsheets
  2. Coordinate with data engineers to create Snowflake schemas
  3. Work with analysts to build dbt models for each layer
  4. Write quality check definitions and hand them to specialists
  5. Curate governance metadata in Collibra by hand
  6. Repeat for every entity in every domain

Typical timeline: 6-8 weeks per data product, involving 4-6 specialists.

With roche-data, your work becomes:

  1. Define the model in RTiS — the entity, its attributes, its relationships
  2. Review generated artifacts — data contracts, quality rules, governance metadata
  3. Approve through pull request — peer-reviewed, version-controlled, auditable

Everything else is generated: Snowflake schemas, dbt models, quality gate predicates, DQ rules, API specifications, semantic definitions, and documentation.

Typical timeline: Minutes from model definition to deployed data product.


Automation does not replace stewardship — it amplifies it.

Your responsibilityHow the platform supports it
Data definitionsYou define entities and relationships in RTiS. The platform compiles them into every downstream artifact.
Quality rulesYou bind shared CEL rules to entity properties in rules.yaml. The platform compiles them into dbt tests and Snowflake UDFs. See Writing DQ Rules.
Ownership & SLAsGovernance metadata from Collibra is pulled automatically and embedded in data contracts.
Access policiesAccess control is managed through Snowflake RBAC and Collibra governance metadata.
Review & approvalEvery generated artifact passes through a pull request. You review and approve before deployment.

Every data product passes through four progressive quality gates. You define the rules; the platform enforces them.

GateWhat it checksYour role
G1 CompletenessSchema compliance, required fields, data typesDefine the schema in RTiS
G2 ValidityMaster data lookups, referential integritySpecify which reference data to validate against
G3 Business RulesRange checks, cross-field consistency, freshnessBind rules from shared library in rules.yaml — see QA Approach
G4 ConsistencyTrend analysis, anomaly detectionReview flagged deviations

Data that passes all four gates earns Certified status and becomes available to AI tools (Cortex Analyst) and external APIs.


For taxonomy and tag management, the platform provides a dedicated Streamlit application:

  • Review and approve proposed changes to classifications, definitions, and tags
  • Track change history with full audit trail
  • Collaborate with domain experts through a structured approval workflow

See Ratification UI for details.