Step by Step Guides
We provide a number of Jupyter notebooks examples to walk through various openclean functionalities. All notebooks and datasets are available in the GitHub repository.
- Downloading master data from Reference Data Repository
- Downloading DOB Job Application Filings from Socrata
- Misspellings in Country Names
- Statistical Outliers in City names
- Misspellings of Brooklyn
- Profiling - DOHMH New York City Restaurant Inspection Results
- Wrangling - DOHMH New York City Restaurant Inspection Results
- Features
- Setting up
- Loading data
- Profiling
- Transformations
- kNN Clustering - DOHMH New York City Restaurant Inspection Results
- Functional Dependency Violations
- Token Signature Outliers for Street Names
- Standardization of Street Names
- User-defined Functions
- Engine - Datastore