Skip to main content

Commands

CommandPurpose
goldenmatch dedupe FILE [...]Deduplicate one or more CSV / Parquet / Excel files.
goldenmatch match TARGET --against REFMatch a target against a reference file.
goldenmatch sync --table TABLE --connection-string "$DB"Incremental Postgres database sync.
goldenmatch watch --table TABLELive stream mode (--daemon for service mode).
goldenmatch serve FILE [...]Start the REST API server (default localhost:8000).
goldenmatch mcp-serve FILE [...]Start the MCP server (stdio or HTTP transport).
goldenmatch interactive FILE [...]Launch the interactive TUI.
goldenmatch serve-ui [--project DIR]Start the web UI workbench.
goldenmatch setupInteractive setup wizard (GPU, API keys, database).
goldenmatch demoRun the built-in demo with sample data.
goldenmatch initInteractive config wizard.
goldenmatch profile FILEProfile data quality.
goldenmatch evaluate FILE --gt GT.csvEvaluate matching accuracy against ground truth.
goldenmatch incremental BASE --new NEWMatch new records against an existing base.
goldenmatch analyze-blocking FILEAnalyze data and suggest blocking strategies.
goldenmatch label FILE --config --gtInteractively label pairs to build ground truth.
goldenmatch review --config CONFIGGuided review loop: walk borderline pairs (and the stale-correction queue) one at a time. Decisions feed Learning Memory.
goldenmatch rollback RUN_IDUndo a previous merge run.
goldenmatch unmerge RECORD_ID --clusters FILERemove a record from its cluster and re-cluster the rest; writes the updated clusters CSV. Use --shatter to break the whole cluster apart.
goldenmatch runsList previous runs for rollback.
goldenmatch memory stats|learn|export|import|showManage the Learning Memory store.
goldenmatch config save|load|list|showManage config presets.
goldenmatch compare-clusters A BCluster comparison (unchanged / merged / partitioned / overlapping).
goldenmatch sensitivity FILEParameter sweep over threshold, blocking, and matchkey.
goldenmatch pprl link FILE_A FILE_BPrivacy-preserving record linkage.

Key dedupe flags

FlagEffect
--anomaliesDetect fake emails, placeholder data, and suspicious records.
--previewPreview results on a sample without writing files.
--merge-previewShow the merge preview (what will change) without writing.
--diff / --diff-htmlGenerate a before/after change report.
--dashboardBefore/after data-quality dashboard (HTML).
--html-reportDetailed match report with charts.
--chunkedLarge-dataset mode (process in chunks).
--backend rayForce a backend.
--llm-boostImprove accuracy with LLM-labeled training.
--daemonRun watch mode as a background service with a health endpoint.

Cloud storage

Read directly from cloud storage by passing a URI:
goldenmatch dedupe s3://bucket/customers.csv
goldenmatch dedupe gs://bucket/customers.csv
goldenmatch dedupe az://container/customers.csv