csvsmith documentation

csvsmith is a toolkit for cleaning, transforming, and managing CSV-oriented workflows.

It provides both a command-line interface (CLI) and a Python API for building reproducible data processing pipelines.

Example

csvsmith excel-to-csv input.xlsx -o output.csv
csvsmith dedupe data.csv -o clean.csv
csvsmith clean-currency-numeric '$1,234.56'

Getting started

Tools

Python API

Development & History