energy_fault_detector.data_preprocessing.column_selector

class ColumnSelector(max_nan_frac_per_col=0.05, features_to_exclude=None)

Class for selecting columns, using the provided list of features to exclude/drop and the fraction of NaNs.

Parameters:

max_nan_frac_per_col (float) – maximum fraction of NaN values allowed per column. Defaults to 0.05. If the fraction exceeds max_nan_frac_per_col, the column is dropped.
features_to_exclude (List[str]) – list of features that should be dropped. Defaults to None.

fit(x, y=None)

Find columns to keep for training

Parameters:

Return type:

ColumnSelector

get_feature_names_out(input_features=None)

Returns the list of feature names in the output.

inverse_transform(x)

Inverse transform does nothing in case of column selector - since the columns dropped are not reconstructed.

transform(x)

Drop columns from dataframe x.