openclean.embedding.feature.base module

Value embedder for a list of feature functions.

class openclean.embedding.feature.base.FeatureEmbedding(features)

Bases: openclean.embedding.base.ValueEmbedder

Value embedder that uses a list of feature generating functions to create a vector for scalar input values.

embed(value)

Return the embedding vector for a given scalar value.

Parameters

value (scalar) – Scalar value (or tuple) in a data stream.

Return type

numpy.array

prepare(values)

Passes the list of values to the vector generator pre-compute any statistics (e.g., min-max values) that are required. Returns a (modified) instance of the feature generator.

Parameters

values (list) – List of data values.