whylogs.core.preprocessing#

Module Contents#

Classes#

ColumnProperties

Generic enumeration.

ListView

NumpyView

PandasView

PreprocessedColumn

View of a column with data of various underlying storage.

Attributes#

whylogs.core.preprocessing.logger#
class whylogs.core.preprocessing.ColumnProperties#

Bases: enum.Enum

Generic enumeration.

Derive from this class to define new enumerations.

default = 0#
homogeneous = 1#
name()#

The name of the Enum member.

value()#

The value of the Enum member.

class whylogs.core.preprocessing.ListView#
ints: Optional[List[int]]#
floats: Optional[List[Union[float, decimal.Decimal]]]#
strings: Optional[List[str]]#
tensors: Optional[List[whylogs.core.stubs.np.ndarray]]#
objs: Optional[List[Any]]#
iterables() List[List[Any]]#
Return type

List[List[Any]]

class whylogs.core.preprocessing.NumpyView#
property len: int#
Return type

int

ints: Optional[whylogs.core.stubs.np.ndarray]#
floats: Optional[whylogs.core.stubs.np.ndarray]#
strings: Optional[whylogs.core.stubs.np.ndarray]#
iterables() List[whylogs.core.stubs.np.ndarray]#
Return type

List[whylogs.core.stubs.np.ndarray]

class whylogs.core.preprocessing.PandasView#
strings: Optional[whylogs.core.stubs.pd.Series]#
tensors: Optional[whylogs.core.stubs.pd.Series]#
objs: Optional[whylogs.core.stubs.pd.Series]#
iterables() List[whylogs.core.stubs.pd.Series]#
Return type

List[whylogs.core.stubs.pd.Series]

class whylogs.core.preprocessing.PreprocessedColumn#

View of a column with data of various underlying storage.

If Pandas is available, we will use Pandas to handle batch processing. If numpy is available, we will use ndarray for numerical values. Otherwise, we preprocess values into typed lists for downstream consumers. We also track the null count and ensure that processed lists/Series don’t contain null values.

numpy: NumpyView#
pandas: PandasView#
list: ListView#
null_count: int = 0#
len: int = 0#
original: Any#
raw_iterator() Iterator[Any]#
Return type

Iterator[Any]

static apply(data: Any) PreprocessedColumn#
Parameters

data (Any) –

Return type

PreprocessedColumn