whylogs.datasets.employee
#
Module Contents#
Classes#
Employee Dataset |
|
Iterator to retrieve production batches, when multiple batches are required. |
Attributes#
- whylogs.datasets.employee.logger#
- whylogs.datasets.employee.base_config#
- class whylogs.datasets.employee.Employee(version: str = 'base')#
Bases:
whylogs.datasets.base.Dataset
Employee Dataset
- Parameters
version (str) –
- baseline_df: pandas.DataFrame#
- production_df: pandas.DataFrame#
- baseline_timestamp: Union[datetime.date, datetime.datetime]#
- production_start_timestamp: Union[datetime.date, datetime.datetime]#
- dataset_config: Optional[whylogs.datasets.configs.DatasetConfig]#
- classmethod config() whylogs.datasets.configs.DatasetConfig #
- Return type
- get_baseline() whylogs.datasets.base.Batch #
- Return type
- abstract get_inference_data(target_date: Optional[Union[datetime.date, datetime.datetime]] = None, number_batches: Optional[int] = None) Union[whylogs.datasets.base.Batch, Iterable[whylogs.datasets.base.Batch]] #
- Parameters
target_date (Optional[Union[datetime.date, datetime.datetime]]) –
number_batches (Optional[int]) –
- Return type
Union[whylogs.datasets.base.Batch, Iterable[whylogs.datasets.base.Batch]]
- get_production_data(target_date: Optional[Union[datetime.date, datetime.datetime]] = None, number_batches: Optional[int] = None) Union[whylogs.datasets.base.Batch, Iterable[whylogs.datasets.base.Batch]] #
Get batch(es) from production dataset.
- Parameters
target_date (Optional[Union[date, datetime]], optional) – Target date for single batch. If datetime is passed, only date will be considered, by default None
number_batches (Optional[int], optional) – Number of batches to be retrieved. Each batch will have a time interval as defined by production_interval from set_parameters. By default None
- Returns
Can return a single batch or an interator of batches, depending on input parameters
- Return type
- set_parameters(production_interval: Optional[str] = None, baseline_timestamp: Optional[Union[datetime.date, datetime.datetime]] = None, production_start_timestamp: Optional[Union[datetime.date, datetime.datetime]] = None, original: Optional[bool] = None) None #
Set timestamp and interval parameters for the dataset object.
- Parameters
production_interval (Optional[str], optional) – Interval for the production batches. If none is passed, daily production batches will be returned, by default None
baseline_timestamp (Optional[Union[date, datetime]], optional) – Timestamp for the baseline dataset. If none is passed, timestamp will be equal to the current day, by default None
production_start_timestamp (Optional[Union[date, datetime]], optional) – Timestamp for the start of the inference dataset. If none is passed, timestamp will be equal to tomorrow’s date, by default None
original (Optional[bool], optional) – _If true, sets both baseline and inference timestamps to the dataset’s original timestamp, by default None
- Return type
- class whylogs.datasets.employee.EmployeeDatasetIterator(df: pandas.DataFrame, number_days: int, number_batches: int, version: str, config=DatasetConfig)#
Iterator to retrieve production batches, when multiple batches are required.
- Parameters
df (pandas.DataFrame) –
number_days (int) –
number_batches (int) –
version (str) –