Enterprise Data Governance Platform
暂停Internal data cataloguing and lineage platform integrating dbt, Metabase, and Supabase to give a 200-person org a single source of truth for 300+ data assets.
技术栈
Designed as a response to growing data sprawl across a scaling organisation, this platform centralises data discovery, ownership, and quality tracking. The core is a dbt project defining canonical business metrics — revenue, retention, pipeline — which feed into a Metabase BI layer accessible to non-technical stakeholders.
A custom Supabase-backed catalogue stores asset metadata (owner, description, SLA, last-tested timestamp) collected via dbt docs and a lightweight Python crawler that probes connected data sources nightly. An alerting layer monitors dbt test failures and pages the asset owner via Slack integration.
The initiative is currently on hold pending organisational alignment on data ownership policies. The technical layer is complete and battle-tested; rollout is waiting on a governance working group to finalise RACI definitions.
其他项目
Serverless appointment-slot monitor for Australian visa applications — sends real-time alerts when new slots open, built on Playwright, Vercel Cron, and Supabase.
A collection of SQL scripts, pgBadger dashboards, and Python automation tools for diagnosing and resolving slow-query issues in high-traffic PostgreSQL clusters.
Enterprise-grade bidirectional data pipeline connecting NetSuite ERP and Boomi AtomSphere, processing 50 k+ transactions per day with sub-3 s latency.