Data Governance · Operating in Public

The framework,
running.

Most portfolios say "I designed data governance frameworks." This page is the framework. Every number on djtakerun.jp comes from BigQuery. Every master record has a Sheets-shaped editor. Every transformation is in Git. Every sync runs nightly. Click around.

① Architecture — read this first.

SOURCES Google Sheets (master) Mixcloud Public API YouTube Data API v3 Strava OAuth INGESTION (CLOUD FUNCTIONS) sync_sheets ✓ sync_mixcloud ✓ sync_youtube sync_strava LAKE (IMMUTABLE) gs://takerun-lake /raw/{source}/YYYY-MM-DD/ WAREHOUSE (BIGQUERY) raw.* staging.* dim.* (master) ref.* (reference) fact.* mart.* (views) SERVING export_json ✓ → /data/*.json djtakerun.jp fetch /data/kpis.json

② Catalog — every table, every column.

Tier Dataset Table Type Created
loading from /data/governance_catalog.json…

③ Sync Status — last touched.

Source Rows Last sync Platforms
loading…

④ Lineage — where data comes from.

Source

Google Sheets

TakeRun edits master data with Editor permission shared to takerun-pipeline SA.

  • dim_identity
  • dim_platform
  • dim_city
  • dim_track_genre
  • dim_inspiration_channel
Lake

Cloud Storage

Immutable raw JSON, versioning ON. Audit trail for any future re-build.

  • raw/sheets/YYYY-MM-DD/
  • raw/mixcloud/YYYY-MM-DD/
  • raw/youtube/YYYY-MM-DD/
  • raw/strava/YYYY-MM-DD/
Warehouse

BigQuery

raw → dim/fact via MERGE on PK. Schema in Git (data/sql/), descriptions UTF-8 safe.

  • 6 datasets
  • 5 dim + 4 ref + 1 fact + 4 mart
Serve

Static JSON / CDN

export_json writes mart_* to gs://takerun-hub/data/*.json. Public, cached 5 min.

  • kpis.json
  • cities.json
  • governance_*.json

⑤ Decisions — why this, not that.

⑥ Codex — DAMA-DMBOK v2 mapping

The TakeRun OS Data Governance Codex v1.0 — 11 policies aligned with DMBOK's 11 knowledge areas, written in Japanese enterprise-doc style (regulation number / effective date / revision history / owner / scope) so it can be presented as-is in interviews.