evalml.guardrails.detect_label_leakage

class evalml.guardrails.detect_label_leakage[source]

Check if any of the features are highly correlated with the target.

Currently only supports binary and numeric targets and features

Parameters
  • X (pd.DataFrame) – The input features to check

  • y (pd.Series) – the labels

  • threshold (float) – the correlation threshold to be considered leakage. Defaults to .95

Returns

leakage, dictionary of features with leakage and corresponding threshold