Skip to main content
The Golden Suite is built so each tool stands alone but composes into a single pipeline. You can pick just the piece you need, or run the full chain end to end.

The pipeline flow

1

InferMap aligns schemas

Auto-maps messy source columns to a known target schema with confidence scores and human-readable reasoning.
2

GoldenCheck profiles and validates

Discovers quality rules from the data itself: encoding, format, nullability, anomalies.
3

GoldenFlow standardizes

Normalizes phone numbers, dates, addresses, and categorical spelling with 76 built-in transforms.
4

GoldenMatch deduplicates

Blocks, scores, clusters, and synthesizes golden records using fuzzy, exact, probabilistic, and LLM scoring.
5

GoldenPipe orchestrates

Runs the whole chain with adaptive logic. It skips transformation if no issues are found and explains every decision.

Polyglot by design

The same engine is implemented across languages so you can run it wherever your data lives.
SurfaceWhat it covers
PythonHeadline runtime. Full feature set, pip install, native Postgres and DuckDB support.
TypeScriptParity core. Edge-safe (Vercel Edge, Cloudflare Workers, Deno). Matches Python scorer outputs to four decimals.
RustPostgres extension (pgrx) and DuckDB UDFs for SQL-native matching.
dbtdbt-goldencheck data-quality tests as a dbt package.
GitHub Actionsgoldencheck-action gates pull requests on data-quality regressions.

AI-native surface

Every package ships an MCP server, a REST API, and an agent surface. Across the suite there are 35+ MCP tools. The AutoConfigController is visible from every interface: the web ControllerPanel, the TUI (Ctrl+A), the CLI, REST endpoints, Postgres functions, DuckDB UDFs, and MCP/agent tools. Add the remote MCP server to Claude Desktop or Claude Code:
{
  "mcpServers": {
    "goldenmatch": {
      "url": "https://goldenmatch-mcp-production.up.railway.app/mcp/"
    }
  }
}

Repository layout

The suite is a single monorepo:
packages/
├── python/       goldenmatch, goldencheck, goldenflow, goldenpipe, infermap, goldensuite-mcp
├── typescript/   goldenmatch, goldencheck, goldenflow, infermap
├── rust/         extensions (Postgres pgrx + DuckDB UDFs)
├── dbt/          dbt-goldencheck
└── actions/      goldencheck GitHub Action
examples/         Python, TypeScript, and Airflow DAG demos

Production paths

  • Postgres sync and daemon mode for continuous deduplication.
  • Review queues for human-in-the-loop correction.
  • dbt integration for warehouse-native pipelines.
  • GitHub Actions for pull-request data-quality gates.
  • Airflow DAGs (drop-in examples, TaskFlow API, Airflow 2.7+).
  • The Rust extension layer for matching directly inside Postgres or DuckDB.