openclean
latest
Contents:
Installation
Users
Contributors
Getting Started
Loading Data
Profiling the Dataset
Selecting Columns
Downloading and Preparing Master data
Identifying Fixes
Making Repairs
More Examples
Data Model
Datasets and Streams
Eval Functions
Col
Cols
Const
And
Or
Data Profiling
Using the openclean profiler
Visualizing profiled results
Data Transformation
Selecting
Inserting
Updating
Filtering
Moving
Sorting
Data Wrangling and Cleaning
Functional Dependency Violations
Missing Values
Misspellings and Data Entry Bugs
Data Standardization
Statistical Outliers
Custom functions
Data Enrichment
Master data using Socrata
Master data using Reference Data Repository
Data Provenance
Initialize
Create
Commit
Checkout
Rollback
Register
Other Examples
Step by Step Guides
Downloading master data from Reference Data Repository
restcountries.eu
Encyclopaedia Britannica
Cleanup
Downloading DOB Job Application Filings from Socrata
Misspellings in Country Names
Download Country Names Masterdata
Identify Country Name Outliers in ITU ICT Development Index (IDI)
Repair Country Name Outliers in ITU ICT Development Index (IDI)
Statistical Outliers in City names
Misspellings of Brooklyn
Profiling - DOHMH New York City Restaurant Inspection Results
Data Profiling
Wrangling - DOHMH New York City Restaurant Inspection Results
Data Cleaning
Extract Relevant Records
Features
Data Profiling
Data Cleaning & Wrangling
Data Enrichment
Data Provenance
Setting up
Loading data
Profiling
Transformations
Date Conversion
Standardizing Spellings
kNN Clustering - DOHMH New York City Restaurant Inspection Results
Extract Relevant Records
Functional Dependency Violations
Token Signature Outliers for Street Names
Standardization of Street Names
User-defined Functions
Engine - Datastore
Notebook Spreadsheet UI
Rollback Changes in Persistent Archive
Extensions
openclean-notebook
openclean-pattern
Configuration
Data Storage
Multi-Threading
Configuration for Workers for External Processes
Contributing
Code
Test Coverage
Bug report or feature request
Documentation
Frequently Asked Questions
Where to report bugs?
API Reference:
openclean
openclean package
Subpackages
Submodules
openclean
»
Index
Edit on GitHub
Index
A
|
B
|
C
|
D
|
E
|
F
|
G
|
H
|
I
|
J
|
K
|
L
|
M
|
N
|
O
|
P
|
R
|
S
|
T
|
U
|
V
|
W
A
action (openclean.engine.log.LogEntry attribute)
ActionHandle (class in openclean.data.archive.base)
Add (class in openclean.function.eval.base)
add() (openclean.cluster.base.Cluster method)
(openclean.cluster.index.ClusterIndex method)
(openclean.cluster.index.Node method)
(openclean.data.groupby.ConflictSummary method)
(openclean.data.groupby.DataFrameGrouping method)
(openclean.data.groupby.DataFrameViolation method)
(openclean.data.mapping.Mapping method)
(openclean.embedding.base.FeatureVector method)
(openclean.engine.log.OperationLog method)
(openclean.function.matching.fuzzy.FuzzySimilarity method)
(openclean.profiling.anomalies.frequency.FrequencyOutlierResults method)
(openclean.profiling.dataset.DatasetProfile method)
Aggregate (class in openclean.operator.collector.aggregate)
aggregate() (in module openclean.operator.collector.aggregate)
All (class in openclean.function.base)
AlphaNumeric (class in openclean.function.value.text)
always_false() (in module openclean.util.core)
And (class in openclean.function.eval.logic)
AnomalyDetector (class in openclean.profiling.anomalies.base)
append() (openclean.pipeline.DataPipeline method)
Apply (class in openclean.operator.transform.apply)
apply() (in module openclean.operator.transform.apply)
(openclean.data.archive.base.ArchiveStore method)
(openclean.data.archive.cache.CachedDatastore method)
(openclean.data.archive.histore.HISTOREDatastore method)
(openclean.engine.base.OpencleanEngine method)
(openclean.engine.dataset.DataSample method)
(openclean.engine.dataset.FullDataset method)
(openclean.function.value.base.ValueFunction method)
(openclean.function.value.filter.Filter method)
ArchiveStore (class in openclean.data.archive.base)
Avg (class in openclean.function.eval.aggregate)
B
best_matches() (in module openclean.function.matching.base)
BestMatch (class in openclean.function.value.domain)
BestMatches (class in openclean.operator.stream.matching)
BinaryOperator (class in openclean.function.eval.base)
BinaryStreamFunction (class in openclean.function.eval.base)
Bool (class in openclean.function.eval.datatype)
BOTH (openclean.profiling.classifier.base.ResultFeatures attribute)
C
CachedDatastore (class in openclean.data.archive.cache)
CacheEntry (class in openclean.data.archive.cache)
CallableWrapper (class in openclean.function.value.base)
Capitalize (class in openclean.function.eval.text)
CapitalizeTokens (class in openclean.function.token.base)
cast() (in module openclean.function.value.datatype)
(openclean.profiling.datatype.convert.DatatypeConverter method)
catalog() (openclean.data.source.socrata.Socrata method)
ChartypeSplit (class in openclean.function.token.split)
checkout() (openclean.data.archive.base.ArchiveStore method)
(openclean.data.archive.cache.CachedDatastore method)
(openclean.data.archive.histore.HISTOREDatastore method)
(openclean.engine.base.OpencleanEngine method)
(openclean.engine.dataset.DatasetHandle method)
Classifier (class in openclean.profiling.classifier.base)
ClassLabel (class in openclean.function.value.classifier)
close() (openclean.cluster.base.StreamClusterer method)
(openclean.operator.stream.collector.Collector method)
(openclean.operator.stream.collector.DataFrame method)
(openclean.operator.stream.collector.Distinct method)
(openclean.operator.stream.collector.RowCount method)
(openclean.operator.stream.collector.Write method)
(openclean.operator.stream.consumer.ProducingConsumer method)
(openclean.operator.stream.consumer.StreamConsumer method)
(openclean.operator.stream.matching.BestMatches method)
(openclean.operator.stream.sample.SampleCollector method)
(openclean.pipeline.DataPipeline method)
(openclean.pipeline.PipelineIterator method)
(openclean.profiling.base.DataProfiler method)
(openclean.profiling.base.DistinctSetProfiler method)
(openclean.profiling.classifier.base.Classifier method)
(openclean.profiling.column.DefaultStreamProfiler method)
(openclean.profiling.dataset.ProfileConsumer method)
(openclean.profiling.tests.ValueCounter method)
Cluster (class in openclean.cluster.base)
cluster() (openclean.pipeline.DataPipeline method)
Clusterer (class in openclean.cluster.base)
ClusterIndex (class in openclean.cluster.index)
clusters() (openclean.cluster.base.Clusterer method)
(openclean.cluster.key.KeyCollision method)
(openclean.cluster.knn.kNNClusterer method)
Col (class in openclean.function.eval.base)
Collector (class in openclean.operator.stream.collector)
Cols (class in openclean.function.eval.base)
column() (openclean.profiling.dataset.DatasetProfile method)
ColumnAggregator (class in openclean.function.eval.aggregate)
ColumnProfile (class in openclean.profiling.column)
columns (openclean.data.groupby.DataFrameGrouping property)
commit() (openclean.data.archive.base.ArchiveStore method)
(openclean.data.archive.cache.CachedDatastore method)
(openclean.data.archive.histore.HISTOREDatastore method)
(openclean.engine.base.OpencleanEngine method)
(openclean.engine.dataset.DatasetHandle method)
CommitOp (class in openclean.engine.action)
compile() (openclean.profiling.pattern.base.Pattern method)
compute() (openclean.function.matching.fuzzy.FuzzySimilarity method)
(openclean.function.value.normalize.numeric.DivideByTotal method)
(openclean.function.value.normalize.numeric.MaxAbsScale method)
(openclean.function.value.normalize.numeric.MinMaxScale method)
(openclean.function.value.normalize.numeric.NumericNormalizer method)
Concat (class in openclean.function.eval.text)
ConditionalOutliers (class in openclean.profiling.anomalies.conditional)
ConditionalStatement (class in openclean.function.value.cond)
conflict_repair() (in module openclean.operator.collector.repair)
ConflictRepair (class in openclean.operator.collector.repair)
conflicts() (openclean.data.groupby.DataFrameViolation method)
ConflictSummary (class in openclean.data.groupby)
Const (class in openclean.function.eval.base)
ConstantValue (class in openclean.function.value.base)
consume() (openclean.cluster.base.StreamClusterer method)
(openclean.operator.stream.collector.Collector method)
(openclean.operator.stream.collector.DataFrame method)
(openclean.operator.stream.collector.Distinct method)
(openclean.operator.stream.collector.RowCount method)
(openclean.operator.stream.collector.Write method)
(openclean.operator.stream.consumer.ProducingConsumer method)
(openclean.operator.stream.consumer.StreamConsumer method)
(openclean.operator.stream.matching.BestMatches method)
(openclean.operator.stream.sample.SampleCollector method)
(openclean.profiling.base.DataProfiler method)
(openclean.profiling.base.DistinctSetProfiler method)
(openclean.profiling.classifier.base.Classifier method)
(openclean.profiling.column.ColumnProfile method)
(openclean.profiling.column.DefaultStreamProfiler method)
(openclean.profiling.dataset.ProfileConsumer method)
(openclean.profiling.stats.MinMaxCollector method)
(openclean.profiling.tests.ValueCounter method)
contains() (openclean.function.token.convert.TokenConverter method)
(openclean.function.token.convert.TokenMapper method)
convert() (openclean.function.token.convert.TokenConverter method)
(openclean.function.token.convert.TokenMapper method)
(openclean.profiling.datatype.convert.DatatypeConverter method)
Count (class in openclean.function.eval.aggregate)
count (openclean.data.groupby.ValueConflicts attribute)
count() (in module openclean.operator.collector.count)
(openclean.pipeline.DataPipeline method)
CounterConverter (class in openclean.function.value.base)
counts() (openclean.profiling.anomalies.frequency.FrequencyOutlierResults method)
create() (in module openclean.data.archive.base)
(openclean.engine.base.OpencleanEngine method)
D
DamerauLevenshteinDistance (class in openclean.function.similarity.text)
data (openclean.embedding.base.FeatureVector property)
data_id (openclean.engine.store.default.StoredObject attribute)
DATADIR() (in module openclean.config)
DataFrame (class in openclean.operator.stream.collector)
DataFrameGrouping (class in openclean.data.groupby)
DataFrameMapper (class in openclean.operator.base)
DataFrameSplitter (class in openclean.operator.base)
DataFrameTransformer (class in openclean.operator.base)
DataFrameViolation (class in openclean.data.groupby)
DataGroupReducer (class in openclean.operator.base)
DataGroupTransformer (class in openclean.operator.base)
DataPipeline (class in openclean.pipeline)
DataProfiler (class in openclean.profiling.base)
DataSample (class in openclean.engine.dataset)
dataset() (in module openclean.data.load)
(openclean.data.source.socrata.Socrata method)
(openclean.engine.base.OpencleanEngine method)
dataset_profile() (in module openclean.profiling.dataset)
DatasetHandle (class in openclean.engine.dataset)
DatasetProfile (class in openclean.profiling.dataset)
DataStore (class in openclean.data.store.base)
DataStreamProfiler (class in openclean.profiling.base)
datatype_outliers() (in module openclean.profiling.anomalies.datatype)
DatatypeConverter (class in openclean.profiling.datatype.convert)
DatatypeOutlierResults (class in openclean.profiling.anomalies.datatype)
DatatypeOutliers (class in openclean.profiling.anomalies.datatype)
Datatypes (class in openclean.profiling.classifier.datatype)
datatypes() (in module openclean.profiling.classifier.datatype)
Datetime (class in openclean.function.eval.datatype)
(class in openclean.function.value.datatype)
DB() (in module openclean.engine.base)
dbscan() (in module openclean.profiling.anomalies.sklearn)
DBSCANOutliers (class in openclean.profiling.anomalies.sklearn)
decorate() (openclean.function.eval.base.Eval method)
default_preproc() (in module openclean.function.value.normalize.text)
default_store() (in module openclean.engine.library)
DefaultColumnProfiler (class in openclean.profiling.column)
DefaultConverter() (in module openclean.profiling.datatype.convert)
DefaultDatatypeClassifier() (in module openclean.function.value.datatype)
DefaultObjectStore (class in openclean.engine.store.default)
DefaultStreamProfiler (class in openclean.profiling.column)
DefaultStringMatcher (class in openclean.function.matching.base)
DefaultTokenizer() (in module openclean.profiling.anomalies.pattern)
delete() (in module openclean.data.archive.base)
(in module openclean.operator.transform.filter)
(openclean.pipeline.DataPipeline method)
delete_annotation() (openclean.data.metadata.base.MetadataStore method)
delete_object() (openclean.data.store.base.DataStore method)
(openclean.data.store.fs.FileSystemJsonStore method)
(openclean.data.store.mem.VolatileDataStore method)
(openclean.engine.store.base.ObjectStore method)
(openclean.engine.store.default.DefaultObjectStore method)
dependant (openclean.profiling.constraints.fd.FunctionalDependency property)
description (openclean.engine.object.base.ObjectHandle attribute)
descriptor (openclean.engine.log.LogEntry attribute)
(openclean.engine.store.default.StoredObject attribute)
deserialize() (openclean.engine.object.base.ObjectFactory method)
(openclean.engine.object.function.FunctionFactory method)
(openclean.engine.object.mapping.MappingFactory method)
(openclean.engine.object.vocabulary.VocabularyFactory method)
determinant (openclean.profiling.constraints.fd.FunctionalDependency property)
df (openclean.data.archive.cache.CacheEntry attribute)
digits_count() (in module openclean.embedding.feature.character)
digits_fraction() (in module openclean.embedding.feature.character)
Distinct (class in openclean.operator.stream.collector)
DISTINCT (openclean.profiling.classifier.base.ResultFeatures attribute)
distinct() (in module openclean.operator.collector.count)
(openclean.pipeline.DataPipeline method)
(openclean.profiling.column.ColumnProfile method)
distinct_values() (openclean.pipeline.DataPipeline method)
DistinctSetProfiler (class in openclean.profiling.base)
DistinctValueProfiler (class in openclean.profiling.column)
Divide (class in openclean.function.eval.base)
divide_by_total() (in module openclean.function.value.normalize.numeric)
DivideByTotal (class in openclean.function.eval.normalize)
(class in openclean.function.value.normalize.numeric)
domain_outliers() (in module openclean.profiling.anomalies.domain)
DomainOutliers (class in openclean.profiling.anomalies.domain)
domains() (openclean.data.source.socrata.Socrata method)
download() (in module openclean.data.refdata)
drop() (openclean.engine.base.OpencleanEngine method)
(openclean.engine.dataset.DataSample method)
(openclean.engine.dataset.DatasetHandle method)
(openclean.engine.dataset.FullDataset method)
DummyMatcher (class in openclean.function.matching.tests)
E
embed() (openclean.embedding.base.ValueEmbedder method)
(openclean.embedding.feature.base.FeatureEmbedding method)
Embedding (class in openclean.embedding.base)
embedding() (in module openclean.embedding.base)
encode() (openclean.function.token.base.Tokenizer method)
EndsWith (class in openclean.function.eval.text)
entropy() (in module openclean.profiling.stats)
Eq (class in openclean.function.eval.base)
Eval (class in openclean.function.eval.base)
eval() (openclean.embedding.feature.frequency.NormalizedFrequency method)
(openclean.embedding.feature.length.NormalizedLength method)
(openclean.engine.library.ObjectLibrary method)
(openclean.function.eval.aggregate.ColumnAggregator method)
(openclean.function.eval.base.BinaryOperator method)
(openclean.function.eval.base.Col method)
(openclean.function.eval.base.Cols method)
(openclean.function.eval.base.Const method)
(openclean.function.eval.base.Eval method)
(openclean.function.eval.base.EvalFunction method)
(openclean.function.eval.domain.Lookup method)
(openclean.function.eval.random.Rand method)
(openclean.function.token.base.Tokens method)
(openclean.function.value.base.CallableWrapper method)
(openclean.function.value.base.ConstantValue method)
(openclean.function.value.base.CounterConverter method)
(openclean.function.value.base.UnpreparedFunction method)
(openclean.function.value.base.ValueFunction method)
(openclean.function.value.classifier.ClassLabel method)
(openclean.function.value.classifier.ValueClassifier method)
(openclean.function.value.cond.ConditionalStatement method)
(openclean.function.value.domain.BestMatch method)
(openclean.function.value.domain.IsInDomain method)
(openclean.function.value.key.fingerprint.Fingerprint method)
(openclean.function.value.mapping.Lookup method)
(openclean.function.value.mapping.Standardize method)
(openclean.function.value.normalize.numeric.NumericNormalizer method)
(openclean.function.value.normalize.text.TextNormalizer method)
(openclean.function.value.phonetic.PhoneticMatcher method)
(openclean.function.value.picker.ValuePicker method)
(openclean.function.value.regex.IsMatch method)
(openclean.function.value.text.AlphaNumeric method)
(openclean.function.value.threshold.ThresholdPredicate method)
(openclean.operator.collector.repair.ValueExtractor method)
eval_all (class in openclean.util.core)
EvalFunction (class in openclean.function.eval.base)
evaluate() (in module openclean.function.eval.base)
ExactMatch (class in openclean.data.mapping)
ExactSimilarity (class in openclean.function.matching.base)
exec() (openclean.embedding.base.Embedding method)
(openclean.profiling.pattern.base.PatternFinder method)
extract() (in module openclean.function.value.base)
F
fd_violations() (in module openclean.operator.map.violations)
FeatureEmbedding (class in openclean.embedding.feature.base)
FeatureVector (class in openclean.embedding.base)
FILE() (in module openclean.data.metadata.fs)
FileSystemJsonStore (class in openclean.data.store.fs)
FileSystemMetadataStore (class in openclean.data.metadata.fs)
FileSystemMetadataStoreFactory (class in openclean.data.metadata.fs)
Filter (class in openclean.function.value.filter)
(class in openclean.operator.transform.filter)
filter() (in module openclean.function.value.filter)
(in module openclean.operator.transform.filter)
(openclean.data.mapping.Mapping method)
(openclean.pipeline.DataPipeline method)
find() (openclean.profiling.anomalies.base.AnomalyDetector method)
(openclean.profiling.pattern.base.PatternFinder method)
find_matches() (openclean.function.matching.base.DefaultStringMatcher method)
(openclean.function.matching.base.StringMatcher method)
Fingerprint (class in openclean.function.value.key.fingerprint)
FirstLastFilter (class in openclean.function.token.filter)
Float (class in openclean.function.eval.datatype)
(class in openclean.function.value.datatype)
FloorDivide (class in openclean.function.eval.base)
Format (class in openclean.function.eval.text)
fraction() (in module openclean.embedding.feature.character)
frequencies() (openclean.profiling.anomalies.frequency.FrequencyOutlierResults method)
frequency_outliers() (in module openclean.profiling.anomalies.frequency)
FrequencyOutlierResults (class in openclean.profiling.anomalies.frequency)
FrequencyOutliers (class in openclean.profiling.anomalies.frequency)
FullDataset (class in openclean.engine.dataset)
FunctionalDependency (class in openclean.profiling.constraints.fd)
FunctionalDependencyFinder (class in openclean.profiling.constraints.fd)
FunctionFactory (class in openclean.engine.object.function)
FunctionHandle (class in openclean.engine.object.function)
FunctionRepository (class in openclean.engine.store.function)
functions() (openclean.engine.library.ObjectLibrary method)
FuzzySimilarity (class in openclean.function.matching.fuzzy)
G
ge() (in module openclean.util.threshold)
Geq (class in openclean.function.eval.base)
Get (class in openclean.function.eval.list)
get() (in module openclean.data.archive.base)
(openclean.data.groupby.DataFrameGrouping method)
(openclean.engine.store.base.ObjectStore method)
get_agg_funcs() (in module openclean.operator.collector.aggregate)
get_annotation() (openclean.data.metadata.base.MetadataStore method)
get_eval_func() (in module openclean.operator.map.groupby)
get_meta() (openclean.data.groupby.DataFrameViolation method)
get_object() (openclean.engine.store.base.ObjectStore method)
(openclean.engine.store.default.DefaultObjectStore method)
get_store() (openclean.data.metadata.base.MetadataStoreFactory method)
(openclean.data.metadata.fs.FileSystemMetadataStoreFactory method)
(openclean.data.metadata.mem.VolatileMetadataStoreFactory method)
get_type() (openclean.function.token.split.ChartypeSplit method)
get_update_function() (in module openclean.operator.transform.update)
get_value() (in module openclean.data.util)
gram_counter() (in module openclean.function.matching.fuzzy)
gram_iterator() (in module openclean.function.matching.fuzzy)
GreaterOrEqual (class in openclean.function.value.threshold)
GreaterThan (class in openclean.function.value.threshold)
Greatest (class in openclean.function.eval.row)
GroupBy (class in openclean.operator.map.groupby)
groupby() (in module openclean.operator.map.groupby)
groups() (openclean.data.groupby.DataFrameGrouping method)
Gt (class in openclean.function.eval.base)
gt() (in module openclean.util.threshold)
H
HammingDistance (class in openclean.function.similarity.text)
handle() (openclean.engine.operator.StreamOperator method)
(openclean.operator.stream.consumer.ProducingConsumer method)
(openclean.operator.stream.consumer.StreamFunctionHandler method)
(openclean.operator.stream.sample.SampleCollector method)
(openclean.operator.transform.limit.LimitConsumer method)
(openclean.profiling.datatype.operator.Typecast method)
has_annotation() (openclean.data.metadata.base.MetadataStore method)
has_two_spec_chars() (in module openclean.function.value.datatype)
head() (openclean.pipeline.DataPipeline method)
HISTOREDatastore (class in openclean.data.archive.histore)
I
InsCol (class in openclean.operator.transform.insert)
inscol() (in module openclean.operator.transform.insert)
insert() (openclean.engine.dataset.DatasetHandle method)
(openclean.pipeline.DataPipeline method)
insert_object() (openclean.engine.store.base.ObjectStore method)
(openclean.engine.store.default.DefaultObjectStore method)
InsertOp (class in openclean.engine.action)
inspos() (openclean.operator.transform.insert.InsCol method)
InsRow (class in openclean.operator.transform.insert)
insrow() (in module openclean.operator.transform.insert)
Int (class in openclean.function.eval.datatype)
(class in openclean.function.value.datatype)
is_datetime() (in module openclean.function.value.datatype)
is_default (openclean.engine.store.default.StoredObject property)
is_empty() (in module openclean.function.value.null)
is_float() (in module openclean.function.value.datatype)
is_frame_mapper() (openclean.operator.base.PipelineStage method)
is_frame_splitter() (openclean.operator.base.PipelineStage method)
is_frame_transformer() (openclean.operator.base.PipelineStage method)
is_group_reducer() (openclean.operator.base.PipelineStage method)
is_group_transformer() (openclean.operator.base.PipelineStage method)
is_insert (openclean.engine.action.OpHandle property)
is_int() (in module openclean.function.value.datatype)
is_list_or_tuple() (in module openclean.util.core)
is_nan() (in module openclean.function.value.datatype)
is_none() (in module openclean.function.value.null)
is_not_empty() (in module openclean.function.value.null)
is_not_none() (in module openclean.function.value.null)
is_numeric() (in module openclean.function.value.datatype)
is_numeric_type() (in module openclean.function.value.datatype)
is_prepared() (openclean.embedding.feature.frequency.NormalizedFrequency method)
(openclean.embedding.feature.length.NormalizedLength method)
(openclean.function.eval.aggregate.ColumnAggregator method)
(openclean.function.value.base.PreparedFunction method)
(openclean.function.value.base.UnpreparedFunction method)
(openclean.function.value.base.ValueFunction method)
(openclean.function.value.classifier.ClassLabel method)
(openclean.function.value.classifier.ValueClassifier method)
(openclean.function.value.normalize.numeric.DivideByTotal method)
(openclean.function.value.normalize.numeric.MaxAbsScale method)
(openclean.function.value.normalize.numeric.MinMaxScale method)
(openclean.function.value.picker.ValuePicker method)
is_satisfied() (openclean.function.similarity.base.SimilarityConstraint method)
is_single_or_dict() (in module openclean.operator.collector.aggregate)
is_update (openclean.engine.action.OpHandle property)
IsDatetime (class in openclean.function.eval.datatype)
IsEmpty (class in openclean.function.eval.null)
IsFloat (class in openclean.function.eval.datatype)
IsIn (class in openclean.function.eval.domain)
IsInDomain (class in openclean.function.value.domain)
IsInt (class in openclean.function.eval.datatype)
IsMatch (class in openclean.function.eval.regex)
(class in openclean.function.value.regex)
IsNaN (class in openclean.function.eval.datatype)
IsNotEmpty (class in openclean.function.eval.null)
IsNotIn (class in openclean.function.eval.domain)
IsNotInDomain (class in openclean.function.value.domain)
IsNotMatch (class in openclean.function.eval.regex)
(class in openclean.function.value.regex)
isolation_forest() (in module openclean.profiling.anomalies.sklearn)
items() (openclean.data.groupby.DataFrameGrouping method)
iterrows() (openclean.pipeline.DataPipeline method)
J
JaroSimilarity (class in openclean.function.similarity.text)
JaroWinklerSimilarity (class in openclean.function.similarity.text)
K
KEY() (in module openclean.data.metadata.mem)
key_collision() (in module openclean.cluster.key)
key_violations() (in module openclean.operator.map.violations)
KeyCollision (class in openclean.cluster.key)
KeyCollisionCluster (class in openclean.cluster.key)
keys() (openclean.data.groupby.DataFrameGrouping method)
KeyValueGenerator (class in openclean.cluster.key)
knn_clusters() (in module openclean.cluster.knn)
knn_collision_clusters() (in module openclean.cluster.knn)
kNNClusterer (class in openclean.cluster.knn)
L
label (openclean.engine.object.base.ObjectHandle attribute)
last_version() (openclean.data.archive.base.ArchiveStore method)
(openclean.data.archive.cache.CachedDatastore method)
(openclean.data.archive.histore.HISTOREDatastore method)
(openclean.engine.log.OperationLog method)
le() (in module openclean.util.threshold)
Least (class in openclean.function.eval.row)
Length (class in openclean.function.eval.text)
Leq (class in openclean.function.eval.base)
letters_count() (in module openclean.embedding.feature.character)
letters_fraction() (in module openclean.embedding.feature.character)
LevenshteinDistance (class in openclean.function.similarity.text)
Limit (class in openclean.operator.transform.limit)
limit() (in module openclean.operator.transform.limit)
(openclean.pipeline.DataPipeline method)
LimitConsumer (class in openclean.operator.transform.limit)
List (class in openclean.function.eval.list)
list() (in module openclean.data.refdata)
list_annotations() (openclean.data.metadata.base.MetadataStore method)
load() (in module openclean.data.refdata)
(openclean.data.source.socrata.SODADataset method)
load_dataset() (openclean.engine.base.OpencleanEngine method)
LoadOp (class in openclean.engine.action)
local_outlier_factor() (in module openclean.profiling.anomalies.sklearn)
log() (openclean.engine.dataset.DatasetHandle method)
LogEntry (class in openclean.engine.log)
Longest (class in openclean.function.value.aggregate)
Lookup (class in openclean.function.eval.domain)
(class in openclean.function.value.mapping)
lookup() (openclean.engine.library.ObjectLibrary method)
lookups() (openclean.engine.library.ObjectLibrary method)
Lower (class in openclean.function.eval.text)
LowerOrEqual (class in openclean.function.value.threshold)
LowerThan (class in openclean.function.value.threshold)
LowerTokens (class in openclean.function.token.base)
Lt (class in openclean.function.eval.base)
lt() (in module openclean.util.threshold)
M
majority_typepicker() (in module openclean.profiling.classifier.typepicker)
MajorityTypePicker (class in openclean.profiling.classifier.typepicker)
MajorityVote (class in openclean.function.value.picker)
(class in openclean.function.value.vote)
manager() (in module openclean.data.archive.base)
map() (openclean.function.value.base.ValueFunction method)
(openclean.operator.base.DataFrameMapper method)
(openclean.operator.map.groupby.GroupBy method)
(openclean.operator.map.violations.Violations method)
Mapping (class in openclean.data.mapping)
mapping() (in module openclean.function.value.mapping)
MappingFactory (class in openclean.engine.object.mapping)
MappingHandle (class in openclean.engine.object.mapping)
match() (openclean.function.matching.base.ExactSimilarity method)
(openclean.function.matching.base.StringSimilarity method)
(openclean.function.matching.fuzzy.FuzzySimilarity method)
(openclean.function.matching.tests.DummyMatcher method)
(openclean.pipeline.DataPipeline method)
match_counts() (openclean.data.mapping.Mapping method)
matched() (openclean.data.mapping.Mapping method)
matched_values() (openclean.function.matching.base.StringMatcher method)
MatchRatingComparison (class in openclean.function.similarity.text)
Max (class in openclean.function.eval.aggregate)
(class in openclean.function.value.aggregate)
max_abs_scale() (in module openclean.function.value.normalize.numeric)
MaxAbsScale (class in openclean.function.eval.normalize)
(class in openclean.function.value.normalize.numeric)
maximum (openclean.profiling.stats.MinMaxCollector property)
merge() (in module openclean.function.value.base)
metadata() (openclean.data.archive.base.ArchiveStore method)
(openclean.data.archive.cache.CachedDatastore method)
(openclean.data.archive.histore.HISTOREDatastore method)
(openclean.engine.base.OpencleanEngine method)
(openclean.engine.dataset.DatasetHandle method)
(openclean.profiling.pattern.base.Pattern method)
MetadataStore (class in openclean.data.metadata.base)
MetadataStoreFactory (class in openclean.data.metadata.base)
Metaphone (class in openclean.function.value.phonetic)
Min (class in openclean.function.eval.aggregate)
(class in openclean.function.value.aggregate)
min_max_scale() (in module openclean.function.value.normalize.numeric)
minimum (openclean.profiling.stats.MinMaxCollector property)
minmax() (openclean.profiling.dataset.DatasetProfile method)
MinMaxCollector (class in openclean.profiling.stats)
MinMaxFilter (class in openclean.function.token.filter)
MinMaxScale (class in openclean.function.eval.normalize)
(class in openclean.function.value.normalize.numeric)
module
openclean
openclean.cluster
openclean.cluster.base
openclean.cluster.index
openclean.cluster.key
openclean.cluster.knn
openclean.config
openclean.data
openclean.data.archive
openclean.data.archive.base
openclean.data.archive.cache
openclean.data.archive.histore
openclean.data.groupby
openclean.data.load
openclean.data.mapping
openclean.data.metadata
openclean.data.metadata.base
openclean.data.metadata.fs
openclean.data.metadata.mem
openclean.data.refdata
openclean.data.schema
openclean.data.sequence
openclean.data.serialize
openclean.data.source
openclean.data.source.socrata
openclean.data.store
openclean.data.store.base
openclean.data.store.fs
openclean.data.store.mem
openclean.data.stream
openclean.data.stream.base
openclean.data.stream.csv
openclean.data.stream.df
openclean.data.types
openclean.data.util
openclean.embedding
openclean.embedding.base
openclean.embedding.feature
openclean.embedding.feature.base
openclean.embedding.feature.character
openclean.embedding.feature.default
openclean.embedding.feature.frequency
openclean.embedding.feature.length
openclean.engine
openclean.engine.action
openclean.engine.base
openclean.engine.dataset
openclean.engine.library
openclean.engine.log
openclean.engine.object
openclean.engine.object.base
openclean.engine.object.function
openclean.engine.object.mapping
openclean.engine.object.vocabulary
openclean.engine.operator
openclean.engine.parallel
openclean.engine.registry
openclean.engine.store
openclean.engine.store.base
openclean.engine.store.default
openclean.engine.store.function
openclean.function
openclean.function.base
openclean.function.eval
openclean.function.eval.aggregate
openclean.function.eval.base
openclean.function.eval.datatype
openclean.function.eval.domain
openclean.function.eval.list
openclean.function.eval.logic
openclean.function.eval.mapping
openclean.function.eval.normalize
openclean.function.eval.null
openclean.function.eval.random
openclean.function.eval.regex
openclean.function.eval.row
openclean.function.eval.text
openclean.function.matching
openclean.function.matching.base
openclean.function.matching.fuzzy
openclean.function.matching.tests
openclean.function.similarity
openclean.function.similarity.base
openclean.function.similarity.text
openclean.function.token
openclean.function.token.base
openclean.function.token.convert
openclean.function.token.filter
openclean.function.token.ngram
openclean.function.token.split
openclean.function.value
openclean.function.value.aggregate
openclean.function.value.base
openclean.function.value.classifier
openclean.function.value.cond
openclean.function.value.datatype
openclean.function.value.domain
openclean.function.value.filter
openclean.function.value.key
openclean.function.value.key.fingerprint
openclean.function.value.mapping
openclean.function.value.normalize
openclean.function.value.normalize.numeric
openclean.function.value.normalize.text
openclean.function.value.null
openclean.function.value.phonetic
openclean.function.value.picker
openclean.function.value.random
openclean.function.value.regex
openclean.function.value.text
openclean.function.value.threshold
openclean.function.value.vote
openclean.operator
openclean.operator.base
openclean.operator.collector
openclean.operator.collector.aggregate
openclean.operator.collector.count
openclean.operator.collector.repair
openclean.operator.map
openclean.operator.map.groupby
openclean.operator.map.violations
openclean.operator.split
openclean.operator.split.split
openclean.operator.stream
openclean.operator.stream.collector
openclean.operator.stream.consumer
openclean.operator.stream.matching
openclean.operator.stream.processor
openclean.operator.stream.sample
openclean.operator.transform
openclean.operator.transform.apply
openclean.operator.transform.filter
openclean.operator.transform.insert
openclean.operator.transform.limit
openclean.operator.transform.move
openclean.operator.transform.rename
openclean.operator.transform.select
openclean.operator.transform.sort
openclean.operator.transform.update
openclean.pipeline
openclean.profiling
openclean.profiling.anomalies
openclean.profiling.anomalies.base
openclean.profiling.anomalies.conditional
openclean.profiling.anomalies.datatype
openclean.profiling.anomalies.domain
openclean.profiling.anomalies.frequency
openclean.profiling.anomalies.pattern
openclean.profiling.anomalies.sklearn
openclean.profiling.base
openclean.profiling.classifier
openclean.profiling.classifier.base
openclean.profiling.classifier.datatype
openclean.profiling.classifier.typepicker
openclean.profiling.column
openclean.profiling.constraints
openclean.profiling.constraints.fd
openclean.profiling.constraints.ucc
openclean.profiling.dataset
openclean.profiling.datatype
openclean.profiling.datatype.convert
openclean.profiling.datatype.operator
openclean.profiling.pattern
openclean.profiling.pattern.base
openclean.profiling.pattern.token_signature
openclean.profiling.stats
openclean.profiling.tests
openclean.util
openclean.util.core
openclean.util.threshold
openclean.version
most_common() (openclean.data.groupby.ConflictSummary method)
move() (openclean.pipeline.DataPipeline method)
move_rows() (in module openclean.operator.transform.move)
MoveCols (class in openclean.operator.transform.move)
movecols() (in module openclean.operator.transform.move)
MoveRows (class in openclean.operator.transform.move)
multi_column_iterator() (in module openclean.data.sequence)
Multiply (class in openclean.function.eval.base)
multitype_columns() (openclean.profiling.dataset.DatasetProfile method)
N
name (openclean.engine.object.base.ObjectHandle attribute)
(openclean.engine.object.function.FunctionHandle attribute)
(openclean.engine.object.mapping.MappingHandle attribute)
(openclean.engine.object.vocabulary.VocabularyHandle attribute)
(openclean.engine.store.default.StoredObject attribute)
names (openclean.engine.action.InsertOp property)
namespace (openclean.engine.object.base.ObjectHandle attribute)
Neq (class in openclean.function.eval.base)
next() (openclean.pipeline.PipelineIterator method)
NGramFingerprint (class in openclean.function.value.key.fingerprint)
NGrams (class in openclean.function.token.ngram)
Node (class in openclean.cluster.index)
NoMatch (class in openclean.data.mapping)
NONDIACRITICS (in module openclean.function.value.normalize.text)
Normalize (class in openclean.function.eval.normalize)
normalize() (in module openclean.function.value.base)
NormalizedEditDistance (class in openclean.function.similarity.text)
NormalizedFrequency (class in openclean.embedding.feature.frequency)
NormalizedLength (class in openclean.embedding.feature.length)
Not (class in openclean.function.eval.logic)
NumericNormalizer (class in openclean.function.value.normalize.numeric)
NYSIIS (class in openclean.function.value.phonetic)
O
object_id (openclean.engine.store.default.StoredObject attribute)
ObjectFactory (class in openclean.engine.object.base)
ObjectHandle (class in openclean.engine.object.base)
ObjectLibrary (class in openclean.engine.library)
ObjectStore (class in openclean.engine.store.base)
ONE (class in openclean.cluster.base)
One (class in openclean.function.base)
one_class_svm() (in module openclean.profiling.anomalies.sklearn)
OnlyOneValue (class in openclean.function.value.picker)
open() (in module openclean.data.refdata)
(openclean.cluster.base.Clusterer method)
(openclean.data.archive.base.ArchiveStore method)
(openclean.data.archive.cache.CachedDatastore method)
(openclean.data.archive.histore.HISTOREDatastore method)
(openclean.engine.dataset.DatasetHandle method)
(openclean.engine.operator.StreamOp method)
(openclean.operator.stream.collector.Collector method)
(openclean.operator.stream.collector.DataFrame method)
(openclean.operator.stream.collector.Distinct method)
(openclean.operator.stream.collector.RowCount method)
(openclean.operator.stream.collector.Write method)
(openclean.operator.stream.matching.BestMatches method)
(openclean.operator.stream.processor.StreamProcessor method)
(openclean.operator.stream.sample.Sample method)
(openclean.operator.transform.filter.Filter method)
(openclean.operator.transform.insert.InsCol method)
(openclean.operator.transform.limit.Limit method)
(openclean.operator.transform.move.MoveCols method)
(openclean.operator.transform.rename.Rename method)
(openclean.operator.transform.select.Select method)
(openclean.operator.transform.update.Update method)
(openclean.pipeline.DataPipeline method)
(openclean.profiling.base.DataProfiler method)
(openclean.profiling.base.DistinctSetProfiler method)
(openclean.profiling.classifier.base.Classifier method)
(openclean.profiling.column.DefaultStreamProfiler method)
(openclean.profiling.dataset.ProfileOperator method)
(openclean.profiling.datatype.operator.Typecast method)
(openclean.profiling.tests.ValueCounter method)
openclean
module
openclean.cluster
module
openclean.cluster.base
module
openclean.cluster.index
module
openclean.cluster.key
module
openclean.cluster.knn
module
openclean.config
module
openclean.data
module
openclean.data.archive
module
openclean.data.archive.base
module
openclean.data.archive.cache
module
openclean.data.archive.histore
module
openclean.data.groupby
module
openclean.data.load
module
openclean.data.mapping
module
openclean.data.metadata
module
openclean.data.metadata.base
module
openclean.data.metadata.fs
module
openclean.data.metadata.mem
module
openclean.data.refdata
module
openclean.data.schema
module
openclean.data.sequence
module
openclean.data.serialize
module
openclean.data.source
module
openclean.data.source.socrata
module
openclean.data.store
module
openclean.data.store.base
module
openclean.data.store.fs
module
openclean.data.store.mem
module
openclean.data.stream
module
openclean.data.stream.base
module
openclean.data.stream.csv
module
openclean.data.stream.df
module
openclean.data.types
module
openclean.data.util
module
openclean.embedding
module
openclean.embedding.base
module
openclean.embedding.feature
module
openclean.embedding.feature.base
module
openclean.embedding.feature.character
module
openclean.embedding.feature.default
module
openclean.embedding.feature.frequency
module
openclean.embedding.feature.length
module
openclean.engine
module
openclean.engine.action
module
openclean.engine.base
module
openclean.engine.dataset
module
openclean.engine.library
module
openclean.engine.log
module
openclean.engine.object
module
openclean.engine.object.base
module
openclean.engine.object.function
module
openclean.engine.object.mapping
module
openclean.engine.object.vocabulary
module
openclean.engine.operator
module
openclean.engine.parallel
module
openclean.engine.registry
module
openclean.engine.store
module
openclean.engine.store.base
module
openclean.engine.store.default
module
openclean.engine.store.function
module
openclean.function
module
openclean.function.base
module
openclean.function.eval
module
openclean.function.eval.aggregate
module
openclean.function.eval.base
module
openclean.function.eval.datatype
module
openclean.function.eval.domain
module
openclean.function.eval.list
module
openclean.function.eval.logic
module
openclean.function.eval.mapping
module
openclean.function.eval.normalize
module
openclean.function.eval.null
module
openclean.function.eval.random
module
openclean.function.eval.regex
module
openclean.function.eval.row
module
openclean.function.eval.text
module
openclean.function.matching
module
openclean.function.matching.base
module
openclean.function.matching.fuzzy
module
openclean.function.matching.tests
module
openclean.function.similarity
module
openclean.function.similarity.base
module
openclean.function.similarity.text
module
openclean.function.token
module
openclean.function.token.base
module
openclean.function.token.convert
module
openclean.function.token.filter
module
openclean.function.token.ngram
module
openclean.function.token.split
module
openclean.function.value
module
openclean.function.value.aggregate
module
openclean.function.value.base
module
openclean.function.value.classifier
module
openclean.function.value.cond
module
openclean.function.value.datatype
module
openclean.function.value.domain
module
openclean.function.value.filter
module
openclean.function.value.key
module
openclean.function.value.key.fingerprint
module
openclean.function.value.mapping
module
openclean.function.value.normalize
module
openclean.function.value.normalize.numeric
module
openclean.function.value.normalize.text
module
openclean.function.value.null
module
openclean.function.value.phonetic
module
openclean.function.value.picker
module
openclean.function.value.random
module
openclean.function.value.regex
module
openclean.function.value.text
module
openclean.function.value.threshold
module
openclean.function.value.vote
module
openclean.operator
module
openclean.operator.base
module
openclean.operator.collector
module
openclean.operator.collector.aggregate
module
openclean.operator.collector.count
module
openclean.operator.collector.repair
module
openclean.operator.map
module
openclean.operator.map.groupby
module
openclean.operator.map.violations
module
openclean.operator.split
module
openclean.operator.split.split
module
openclean.operator.stream
module
openclean.operator.stream.collector
module
openclean.operator.stream.consumer
module
openclean.operator.stream.matching
module
openclean.operator.stream.processor
module
openclean.operator.stream.sample
module
openclean.operator.transform
module
openclean.operator.transform.apply
module
openclean.operator.transform.filter
module
openclean.operator.transform.insert
module
openclean.operator.transform.limit
module
openclean.operator.transform.move
module
openclean.operator.transform.rename
module
openclean.operator.transform.select
module
openclean.operator.transform.sort
module
openclean.operator.transform.update
module
openclean.pipeline
module
openclean.profiling
module
openclean.profiling.anomalies
module
openclean.profiling.anomalies.base
module
openclean.profiling.anomalies.conditional
module
openclean.profiling.anomalies.datatype
module
openclean.profiling.anomalies.domain
module
openclean.profiling.anomalies.frequency
module
openclean.profiling.anomalies.pattern
module
openclean.profiling.anomalies.sklearn
module
openclean.profiling.base
module
openclean.profiling.classifier
module
openclean.profiling.classifier.base
module
openclean.profiling.classifier.datatype
module
openclean.profiling.classifier.typepicker
module
openclean.profiling.column
module
openclean.profiling.constraints
module
openclean.profiling.constraints.fd
module
openclean.profiling.constraints.ucc
module
openclean.profiling.dataset
module
openclean.profiling.datatype
module
openclean.profiling.datatype.convert
module
openclean.profiling.datatype.operator
module
openclean.profiling.pattern
module
openclean.profiling.pattern.base
module
openclean.profiling.pattern.token_signature
module
openclean.profiling.stats
module
openclean.profiling.tests
module
openclean.util
module
openclean.util.core
module
openclean.util.threshold
module
openclean.version
module
OpencleanEngine (class in openclean.engine.base)
OperationLog (class in openclean.engine.log)
OpHandle (class in openclean.engine.action)
Or (class in openclean.function.eval.logic)
order_by() (in module openclean.operator.transform.sort)
outlier() (openclean.profiling.anomalies.conditional.ConditionalOutliers method)
(openclean.profiling.anomalies.datatype.DatatypeOutliers method)
(openclean.profiling.anomalies.domain.DomainOutliers method)
(openclean.profiling.anomalies.pattern.RegExOutliers method)
(openclean.profiling.anomalies.pattern.TokenSignatureOutliers method)
P
Pattern (class in openclean.profiling.pattern.base)
pattern() (openclean.profiling.pattern.base.Pattern method)
PatternFinder (class in openclean.profiling.pattern.base)
persist() (openclean.pipeline.DataPipeline method)
PhoneticMatcher (class in openclean.function.value.phonetic)
pick() (openclean.function.value.picker.MajorityVote method)
(openclean.function.value.picker.OnlyOneValue method)
(openclean.function.value.picker.ValuePicker method)
PipelineIterator (class in openclean.pipeline)
PipelineStage (class in openclean.operator.base)
Pow (class in openclean.function.eval.base)
prepare() (openclean.embedding.base.ValueEmbedder method)
(openclean.embedding.feature.base.FeatureEmbedding method)
(openclean.embedding.feature.frequency.NormalizedFrequency method)
(openclean.embedding.feature.length.NormalizedLength method)
(openclean.function.eval.aggregate.ColumnAggregator method)
(openclean.function.eval.base.BinaryOperator method)
(openclean.function.eval.base.Col method)
(openclean.function.eval.base.Cols method)
(openclean.function.eval.base.Const method)
(openclean.function.eval.base.Eval method)
(openclean.function.eval.base.EvalFunction method)
(openclean.function.eval.domain.Lookup method)
(openclean.function.eval.random.Rand method)
(openclean.function.value.aggregate.ValueAggregator method)
(openclean.function.value.base.PreparedFunction method)
(openclean.function.value.base.ValueFunction method)
(openclean.function.value.classifier.ClassLabel method)
(openclean.function.value.classifier.ValueClassifier method)
(openclean.function.value.normalize.numeric.DivideByTotal method)
(openclean.function.value.normalize.numeric.MaxAbsScale method)
(openclean.function.value.normalize.numeric.MinMaxScale method)
(openclean.function.value.picker.ValuePicker method)
(openclean.function.value.random.RandomSelect method)
(openclean.function.value.vote.MajorityVote method)
(openclean.operator.collector.repair.ValueExtractor method)
PreparedFunction (class in openclean.function.value.base)
process() (openclean.operator.stream.consumer.StreamConsumer method)
(openclean.profiling.anomalies.conditional.ConditionalOutliers method)
(openclean.profiling.anomalies.frequency.FrequencyOutliers method)
(openclean.profiling.anomalies.sklearn.SklearnOutliers method)
(openclean.profiling.base.DataProfiler method)
(openclean.profiling.base.DataStreamProfiler method)
(openclean.profiling.classifier.typepicker.MajorityTypePicker method)
(openclean.profiling.classifier.typepicker.ThresholdTypePicker method)
(openclean.profiling.column.DefaultColumnProfiler method)
process_list() (in module openclean.engine.parallel)
ProducingConsumer (class in openclean.operator.stream.consumer)
profile() (openclean.pipeline.DataPipeline method)
(openclean.profiling.dataset.Profiler method)
ProfileConsumer (class in openclean.profiling.dataset)
ProfileOperator (class in openclean.profiling.dataset)
Profiler (class in openclean.profiling.dataset)
profiles() (openclean.profiling.dataset.DatasetProfile method)
R
Rand (class in openclean.function.eval.random)
RandomSelect (class in openclean.function.value.random)
read() (openclean.data.metadata.base.MetadataStore method)
(openclean.data.metadata.fs.FileSystemMetadataStore method)
(openclean.data.metadata.mem.VolatileMetadataStore method)
read_object() (openclean.data.store.base.DataStore method)
(openclean.data.store.fs.FileSystemJsonStore method)
(openclean.data.store.mem.VolatileDataStore method)
reduce() (openclean.operator.base.DataGroupReducer method)
(openclean.operator.collector.aggregate.Aggregate method)
(openclean.operator.collector.repair.ConflictRepair method)
RefStore (class in openclean.data.refdata)
regex_outliers() (in module openclean.profiling.anomalies.pattern)
regex_type (openclean.function.token.base.Token property)
RegExOutliers (class in openclean.profiling.anomalies.pattern)
register (openclean.engine.base.OpencleanEngine property)
remove() (in module openclean.data.refdata)
Rename (class in openclean.operator.transform.rename)
rename() (in module openclean.operator.transform.rename)
(openclean.operator.transform.rename.Rename method)
(openclean.pipeline.DataPipeline method)
reorder() (openclean.operator.transform.move.MoveCols method)
repair_mapping() (in module openclean.data.util)
RepeatedTokenFilter (class in openclean.function.token.filter)
replace() (in module openclean.function.value.mapping)
repository() (in module openclean.data.refdata)
ResultFeatures (class in openclean.profiling.classifier.base)
ReverseTokens (class in openclean.function.token.base)
robust_covariance() (in module openclean.profiling.anomalies.sklearn)
rollback() (openclean.data.archive.base.ArchiveStore method)
(openclean.data.archive.cache.CachedDatastore method)
(openclean.data.archive.histore.HISTOREDatastore method)
(openclean.data.metadata.base.MetadataStoreFactory method)
(openclean.data.metadata.fs.FileSystemMetadataStoreFactory method)
(openclean.data.metadata.mem.VolatileMetadataStoreFactory method)
(openclean.engine.base.OpencleanEngine method)
(openclean.engine.dataset.DatasetHandle method)
RowCount (class in openclean.operator.stream.collector)
rows() (openclean.data.groupby.DataFrameGrouping method)
run() (openclean.pipeline.DataPipeline method)
(openclean.profiling.base.DataProfiler method)
(openclean.profiling.constraints.fd.FunctionalDependencyFinder method)
(openclean.profiling.constraints.ucc.UniqueColumnCombinationFinder method)
S
Sample (class in openclean.operator.stream.sample)
sample() (openclean.engine.base.OpencleanEngine method)
(openclean.pipeline.DataPipeline method)
SampleCollector (class in openclean.operator.stream.sample)
SampleOp (class in openclean.engine.action)
scalar_pass_through() (in module openclean.util.core)
schema() (openclean.data.archive.base.ArchiveStore method)
(openclean.data.archive.cache.CachedDatastore method)
(openclean.data.archive.histore.HISTOREDatastore method)
score (openclean.data.mapping.ExactMatch attribute)
(openclean.data.mapping.NoMatch attribute)
(openclean.data.mapping.StringMatch attribute)
score() (openclean.function.matching.base.StringSimilarity method)
search() (openclean.function.matching.fuzzy.FuzzySimilarity method)
Select (class in openclean.operator.transform.select)
select() (in module openclean.operator.transform.select)
(openclean.operator.map.groupby.GroupBy static method)
(openclean.operator.map.violations.Violations static method)
(openclean.pipeline.DataPipeline method)
Sequence (class in openclean.data.sequence)
serialize() (openclean.engine.object.base.ObjectFactory method)
(openclean.engine.object.function.FunctionFactory method)
(openclean.engine.object.mapping.MappingFactory method)
(openclean.engine.object.vocabulary.VocabularyFactory method)
set_annotation() (openclean.data.metadata.base.MetadataStore method)
set_consumer() (openclean.operator.stream.consumer.ProducingConsumer method)
Shortest (class in openclean.function.value.aggregate)
sim() (openclean.function.similarity.base.SimilarityFunction method)
(openclean.function.similarity.text.MatchRatingComparison method)
(openclean.function.similarity.text.NormalizedEditDistance method)
(openclean.function.similarity.text.StringSimilarityFunction method)
SimilarityConstraint (class in openclean.function.similarity.base)
SimilarityFunction (class in openclean.function.similarity.base)
single_column_iterator() (in module openclean.data.sequence)
size (openclean.function.token.base.Token property)
SklearnOutliers (class in openclean.profiling.anomalies.sklearn)
snapshots() (openclean.data.archive.base.ArchiveStore method)
(openclean.data.archive.cache.CachedDatastore method)
(openclean.data.archive.histore.HISTOREDatastore method)
Socrata (class in openclean.data.source.socrata)
SODADataset (class in openclean.data.source.socrata)
Sort (class in openclean.operator.transform.sort)
SortTokens (class in openclean.function.token.base)
Soundex (class in openclean.function.value.phonetic)
spec_char_count() (in module openclean.embedding.feature.character)
spec_char_fraction() (in module openclean.embedding.feature.character)
Split (class in openclean.function.token.split)
(class in openclean.operator.split.split)
split() (in module openclean.operator.split.split)
(openclean.operator.base.DataFrameSplitter method)
(openclean.operator.split.split.Split method)
StandardEmbedding (class in openclean.embedding.feature.default)
Standardize (class in openclean.function.eval.mapping)
(class in openclean.function.value.mapping)
StandardizeTokens (class in openclean.function.token.base)
StartsWith (class in openclean.function.eval.text)
stats() (openclean.profiling.dataset.DatasetProfile method)
store() (in module openclean.data.refdata)
StoredObject (class in openclean.engine.store.default)
Str (class in openclean.function.eval.datatype)
stream() (in module openclean.pipeline)
(openclean.engine.base.OpencleanEngine method)
(openclean.pipeline.DataPipeline method)
StreamClusterer (class in openclean.cluster.base)
StreamConsumer (class in openclean.operator.stream.consumer)
StreamFunctionHandler (class in openclean.operator.stream.consumer)
StreamOp (class in openclean.engine.operator)
StreamOperator (class in openclean.engine.operator)
StreamProcessor (class in openclean.operator.stream.processor)
StringFunction (class in openclean.function.eval.text)
StringMatch (class in openclean.data.mapping)
StringMatcher (class in openclean.function.matching.base)
StringSimilarity (class in openclean.function.matching.base)
StringSimilarityFunction (class in openclean.function.similarity.text)
Subtract (class in openclean.function.eval.base)
suggestion() (openclean.cluster.base.Cluster method)
Sum (class in openclean.function.eval.aggregate)
summarize_conflicts() (openclean.data.groupby.DataFrameViolation method)
swap() (in module openclean.operator.transform.update)
T
tenary_pass_through() (in module openclean.util.core)
term (openclean.data.mapping.ExactMatch attribute)
(openclean.data.mapping.NoMatch attribute)
(openclean.data.mapping.StringMatch attribute)
TernaryStreamFunction (class in openclean.function.eval.base)
TextNormalizer (class in openclean.function.value.normalize.text)
THREADS() (in module openclean.config)
Threshold (class in openclean.util.threshold)
threshold_typepicker() (in module openclean.profiling.classifier.typepicker)
ThresholdPredicate (class in openclean.function.value.threshold)
ThresholdTypePicker (class in openclean.profiling.classifier.typepicker)
to_column_eval() (in module openclean.function.eval.base)
to_const_eval() (in module openclean.function.eval.base)
to_datetime() (in module openclean.function.value.datatype)
to_datetime_format() (in module openclean.function.value.datatype)
to_df() (openclean.pipeline.DataPipeline method)
to_dict() (openclean.data.archive.base.ActionHandle method)
(openclean.engine.action.InsertOp method)
(openclean.engine.action.OpHandle method)
(openclean.engine.object.base.ObjectHandle method)
(openclean.engine.store.default.StoredObject method)
(openclean.profiling.pattern.base.Pattern method)
to_eval() (in module openclean.function.eval.base)
(openclean.engine.action.CommitOp method)
(openclean.engine.action.InsertOp method)
(openclean.engine.action.LoadOp method)
(openclean.engine.action.OpHandle method)
(openclean.engine.action.SampleOp method)
(openclean.engine.action.UpdateOp method)
to_float() (in module openclean.function.value.datatype)
to_int() (in module openclean.function.value.datatype)
to_len() (in module openclean.function.value.text)
to_listing() (openclean.engine.store.base.ObjectStore method)
(openclean.engine.store.default.DefaultObjectStore method)
to_lookup() (openclean.data.mapping.Mapping method)
to_lower() (in module openclean.function.value.domain)
(in module openclean.function.value.text)
to_mapping() (openclean.cluster.base.Cluster method)
to_set() (in module openclean.data.util)
to_string() (in module openclean.function.value.datatype)
to_threshold() (in module openclean.util.threshold)
to_title() (in module openclean.function.value.text)
to_tuple() (openclean.function.token.base.Token method)
to_upper() (in module openclean.function.value.text)
to_value_function() (in module openclean.function.value.base)
Token (class in openclean.function.token.base)
token_signature() (in module openclean.profiling.pattern.token_signature)
TokenConverter (class in openclean.function.token.convert)
TokenFilter (class in openclean.function.token.filter)
Tokenizer (class in openclean.function.token.base)
TokenListConverter (class in openclean.function.token.convert)
TokenMapper (class in openclean.function.token.convert)
TokenPrefix (class in openclean.function.token.base)
Tokens (class in openclean.function.token.base)
tokens() (openclean.function.token.base.Tokenizer method)
(openclean.function.token.base.Tokens method)
(openclean.function.token.ngram.NGrams method)
(openclean.function.token.split.ChartypeSplit method)
(openclean.function.token.split.Split method)
TokenSignatureOutliers (class in openclean.profiling.anomalies.pattern)
TokenTransformer (class in openclean.function.token.base)
TokenTransformerPipeline (class in openclean.function.token.base)
TokenTypeFilter (class in openclean.function.token.filter)
TOTAL (openclean.profiling.classifier.base.ResultFeatures attribute)
transform() (openclean.function.token.base.ReverseTokens method)
(openclean.function.token.base.SortTokens method)
(openclean.function.token.base.TokenPrefix method)
(openclean.function.token.base.TokenTransformer method)
(openclean.function.token.base.TokenTransformerPipeline method)
(openclean.function.token.base.UniqueTokens method)
(openclean.function.token.base.UpdateTokens method)
(openclean.function.token.convert.TokenConverter method)
(openclean.function.token.convert.TokenListConverter method)
(openclean.function.token.filter.FirstLastFilter method)
(openclean.function.token.filter.RepeatedTokenFilter method)
(openclean.function.token.filter.TokenFilter method)
(openclean.function.token.filter.TokenTypeFilter method)
(openclean.operator.base.DataFrameTransformer method)
(openclean.operator.base.DataGroupTransformer method)
(openclean.operator.transform.apply.Apply method)
(openclean.operator.transform.filter.Filter method)
(openclean.operator.transform.insert.InsCol method)
(openclean.operator.transform.insert.InsRow method)
(openclean.operator.transform.limit.Limit method)
(openclean.operator.transform.move.MoveCols method)
(openclean.operator.transform.move.MoveRows method)
(openclean.operator.transform.rename.Rename method)
(openclean.operator.transform.select.Select method)
(openclean.operator.transform.sort.Sort method)
(openclean.operator.transform.update.Update method)
truncate() (openclean.engine.log.OperationLog method)
type() (openclean.function.token.base.Token method)
Typecast (class in openclean.profiling.datatype.operator)
typecast() (openclean.pipeline.DataPipeline method)
types() (openclean.profiling.anomalies.datatype.DatatypeOutlierResults method)
(openclean.profiling.dataset.DatasetProfile method)
U
UnaryStreamFunction (class in openclean.function.eval.base)
unique_columns() (openclean.profiling.dataset.DatasetProfile method)
unique_count() (in module openclean.embedding.feature.character)
unique_fraction() (in module openclean.embedding.feature.character)
unique_identifier() (in module openclean.util.core)
UniqueColumnCombinationFinder (class in openclean.profiling.constraints.ucc)
UniqueSetEmbedding (class in openclean.embedding.feature.default)
UniqueTokens (class in openclean.function.token.base)
unmatched() (openclean.data.mapping.Mapping method)
UnpreparedFunction (class in openclean.function.value.base)
Update (class in openclean.operator.transform.update)
update() (in module openclean.operator.transform.update)
(openclean.data.mapping.Mapping method)
(openclean.engine.dataset.DatasetHandle method)
(openclean.pipeline.DataPipeline method)
UpdateOp (class in openclean.engine.action)
UpdateTokens (class in openclean.function.token.base)
Upper (class in openclean.function.eval.text)
UpperTokens (class in openclean.function.token.base)
V
value (openclean.function.token.base.Token property)
ValueAggregator (class in openclean.function.value.aggregate)
ValueClassifier (class in openclean.function.value.classifier)
ValueConflicts (class in openclean.data.groupby)
ValueCounter (class in openclean.profiling.tests)
ValueEmbedder (class in openclean.embedding.base)
ValueExtractor (class in openclean.operator.collector.repair)
ValueFunction (class in openclean.function.value.base)
ValuePicker (class in openclean.function.value.picker)
values (openclean.data.groupby.ValueConflicts attribute)
values() (openclean.data.groupby.DataFrameGrouping method)
(openclean.profiling.anomalies.datatype.DatatypeOutlierResults method)
(openclean.profiling.anomalies.frequency.FrequencyOutlierResults method)
version (openclean.data.archive.cache.CacheEntry attribute)
(openclean.engine.log.LogEntry attribute)
version() (openclean.engine.dataset.DatasetHandle method)
Violations (class in openclean.operator.map.violations)
vocabularies() (openclean.engine.library.ObjectLibrary method)
vocabulary() (openclean.engine.library.ObjectLibrary method)
VocabularyFactory (class in openclean.engine.object.vocabulary)
VocabularyHandle (class in openclean.engine.object.vocabulary)
VolatileDataStore (class in openclean.data.store.mem)
VolatileMetadataStore (class in openclean.data.metadata.mem)
VolatileMetadataStoreFactory (class in openclean.data.metadata.mem)
W
where() (openclean.pipeline.DataPipeline method)
whitespace_count() (in module openclean.embedding.feature.character)
whitespace_fraction() (in module openclean.embedding.feature.character)
Write (class in openclean.operator.stream.collector)
write() (openclean.data.metadata.base.MetadataStore method)
(openclean.data.metadata.fs.FileSystemMetadataStore method)
(openclean.data.metadata.mem.VolatileMetadataStore method)
(openclean.data.source.socrata.SODADataset method)
(openclean.pipeline.DataPipeline method)
write_object() (openclean.data.store.base.DataStore method)
(openclean.data.store.fs.FileSystemJsonStore method)
(openclean.data.store.mem.VolatileDataStore method)
Read the Docs
v: latest
Versions
latest
stable
Downloads
html
On Read the Docs
Project Home
Builds