CLI reference
Commands, flags, and the config format.
Performance
Vectorized fast paths and the optional Arrow-native phone kernel.
Install
goldenflow[check] adds GoldenCheck integration, goldenflow[mcp] adds the MCP server, and goldenflow[all] brings everything.
Quickstart
Zero-config auto-detect:Key features
- Zero-config mode: auto-detects column types and applies safe transforms.
- 76 transforms across 11 categories.
- 5 domain packs: healthcare, finance, ecommerce, people/HR, real estate.
- Audit trail: a JSON manifest of every transform, which rows changed, and before/after samples.
- Schema mapping: auto-map columns between systems by name similarity and data profiling.
- Streaming: process large files in chunks via
StreamProcessor. - Cloud connectors: transparent
s3://andgs://paths. - Watch and schedule: auto-transform new files, or run on an interval.
- LLM-enhanced mode: optional categorical corrections via
--llm. - Polars-native, with Jupyter-friendly rich rendering. Date and phone normalization use vectorized fast paths (≈14× faster end-to-end), plus an optional Arrow-native Rust kernel for the phone tail.