hookeai.utilities.fit_data_scalers.mean_std_batch_fit

mean_std_batch_fit(dataset, features_type, n_features, is_verbose=False)[source]

Perform batch fitting of standardization data scaler.

Parameters:
  • dataset (torch.utils.data.Dataset) – Time series data set. Each sample is stored as a dictionary where each feature (key, str) data is a torch.Tensor(2d) of shape (sequence_length, n_features).

  • features_type (str) – Features for which data scaler is fitted (e.g., ‘features_in’, ‘features_out’). Must be directly available from data set samples.

  • n_features (int) – Number of features (dimensionality).

  • is_verbose (bool, default=False) – If True, enable verbose output.

Returns:

  • mean (torch.Tensor) – Features standardization mean tensor stored as a torch.Tensor with shape (n_features,).

  • std (torch.Tensor) – Features standardization standard deviation tensor stored as a torch.Tensor with shape (n_features,).

Notes

A biased estimator is used to compute the standard deviation according with scikit-learn 1.3.2 documentation (sklearn.preprocessing.StandardScaler).