Skip to content

Collibra

Collibra is Roche’s enterprise data governance platform. It is bidirectional in the roche-data pipeline: the govern module pulls stewardship metadata (ownership, SLAs, classification, PII flags) at generation time, and the register module pushes lineage information back after deployment.

Access to Collibra is proxied through Mulesoft — there is no direct API access.

PropertyValue
URLhttps://roche.collibra.com (via Mulesoft proxy)
Auth methodClient credentials (Mulesoft proxy headers)
NetworkVia Mulesoft API proxy (no direct Collibra access)
Access task (read)A07
Access task (write)A08
GitHub issue (read)#25 (closed — resolved)
GitHub issue (write)#76 (open — blocked)
VariableSourceDescription
COLLIBRA_BASE_URLVault common/collibraBase URL for Collibra API (via Mulesoft proxy)
MULESOFT_MODEL_CLIENT_IDVault {env}/collibraOAuth client ID for Mulesoft proxy authentication
MULESOFT_MODEL_CLIENT_SECRETVault {env}/collibraOAuth client secret for Mulesoft proxy authentication
X-META-BRIDGE-KEYVault {env}/collibraAdditional header key for Mulesoft bridge
ModuleDirectionUsage
rdt-model-governPull (read)Fetches governance metadata — ownership, SLAs, classification, PII flags — writes governance.json
rdt-model-registerPush (write)Pushes lineage records and quality gate results after deployment

Collibra data flow — bidirectional governance metadata through Mulesoft proxy

Script: scripts/access/check-collibra.sh

Required tools: curl, jq

Required env vars: MULESOFT_MODEL_CLIENT_ID, MULESOFT_MODEL_CLIENT_SECRET

Checks performed:

  1. Mulesoft proxy reachability (GET /rest/2.0/communities)
  2. Client credential validation
  3. Community data retrieval (community count)
  4. Asset search (GET /rest/2.0/assets) for governance metadata read
  5. Write access status (currently reports blocked — A08 pending)

All Collibra API calls go through the Mulesoft proxy with client credentials:

Terminal window
curl -s "https://roche.collibra.com/rest/2.0/communities" \
-H "client_id: $MULESOFT_MODEL_CLIENT_ID" \
-H "client_secret: $MULESOFT_MODEL_CLIENT_SECRET" \
-H "X-META-BRIDGE-KEY: $X_META_BRIDGE_KEY"
MethodPathModulePurpose
GET/rest/2.0/communitiesgovernList available governance communities
GET/rest/2.0/assetsgovernSearch for entity assets by name/type
GET/rest/2.0/assets/{id}/attributesgovernGet governance attributes (SLA, PII, etc.)
POST/rest/2.0/assetsregisterCreate lineage assets (blocked — A08)
POST/rest/2.0/relationsregisterCreate lineage relationships (blocked — A08)

The govern module pulls and writes to models/{entity}/governance.json:

FieldCollibra sourceUsed by
Data stewardAsset responsibilityContract, docs
Data ownerAsset responsibilityContract, docs
SLACustom attributeContract, policy
ClassificationCustom attributePolicy (access domain)
PII flagsCustom attributePolicy (access domain), Silver enrichment
Quality scoreCustom attributeGold view predicates
  • Read path: Fully implemented (HttpCollibraClient via Mulesoft proxy, PR #114)
  • Write path: Blocked pending write-access service account (A08)
  • Workaround: StubCollibraClient returns fixture governance data for all entities