data_check

Base class for all data checks.

Module Contents

Classes Summary

DataCheck

Base class for all data checks.

Contents

class evalml.data_checks.data_check.DataCheck[source]

Base class for all data checks.

Data checks are a set of heuristics used to determine if there are problems with input data.

Methods

name

Return a name describing the data check.

validate

Inspect and validate the input data, runs any necessary calculations or algorithms, and returns a list of warnings and errors if applicable.

name(cls)

Return a name describing the data check.

abstract validate(self, X, y=None)[source]

Inspect and validate the input data, runs any necessary calculations or algorithms, and returns a list of warnings and errors if applicable.

Parameters
  • X (pd.DataFrame) – The input data of shape [n_samples, n_features]

  • y (pd.Series, optional) – The target data of length [n_samples]

Returns

Dictionary of DataCheckError and DataCheckWarning messages

Return type

dict (DataCheckMessage)