Deduplicate a CSV in 30 seconds, then run the full pipeline.
This page gets you from install to a deduplicated file in a couple of minutes. For a deeper tour of the headline package, see the GoldenMatch quickstart.
To profile, standardize, and deduplicate in one call, use GoldenPipe. It runs GoldenCheck, conditionally routes through GoldenFlow, then deduplicates with GoldenMatch.
pip install goldenpipe
import goldenpipe as gpresult = gp.run("customers.csv")print(result.status) # "success"print(result.reasoning) # why each stage ran or was skipped