API Reference¶

Demo Datasets¶

`load_fraud`	Load credit card fraud dataset.
`load_wine`	Load wine dataset.
`load_breast_cancer`	Load breast cancer dataset.
`load_diabetes`	Load diabetes dataset.

`load_data`	Load features and labels from file(s).
`split_data`	Splits data into train and test sets.

`AutoClassifier`	Automatic pipeline search for classification problems
`AutoRegressor`	Automatic pipeline search for regression problems

List model type for a particular problem type

`get_pipelines`	Returns potential pipelines by model type
`save_pipeline`	Saves pipeline at file path
`load_pipeline`	Loads pipeline at file path
`RFClassificationPipeline`	Random Forest Pipeline for both binary and multiclass classification
`XGBoostPipeline`	XGBoost Pipeline for both binary and multiclass classification
`LogisticRegressionPipeline`	Logistic Regression Pipeline for both binary and multiclass classification
`RFRegressionPipeline`	Random Forest Pipeline for regression

`FraudCost`	Score the percentage of money lost of the total transaction amount process due to fraud
`LeadScoring`	Lead scoring

`F1`	F1 Score for binary classification
`F1Micro`	F1 Score for multiclass classification using micro averaging
`F1Macro`	F1 Score for multiclass classification using macro averaging
`F1Weighted`	F1 Score for multiclass classification using weighted averaging
`Precision`	Precision Score for binary classification
`PrecisionMicro`	Precision Score for multiclass classification using micro averaging
`PrecisionMacro`	Precision Score for multiclass classification using macro averaging
`PrecisionWeighted`	Precision Score for multiclass classification using weighted averaging
`Recall`	Recall Score for binary classification
`RecallMicro`	Recall Score for multiclass classification using micro averaging
`RecallMacro`	Recall Score for multiclass classification using macro averaging
`RecallWeighted`	Recall Score for multiclass classification using weighted averaging
`AUC`	AUC Score for binary classification
`AUCMicro`	AUC Score for multiclass classification using micro averaging
`AUCMacro`	AUC Score for multiclass classification using macro averaging
`AUCWeighted`	AUC Score for multiclass classification using weighted averaging
`LogLoss`	Log Loss for both binary and multiclass classification
`MCC`	Matthews correlation coefficient for both binary and multiclass classification

`R2`	Coefficient of determination for regression
`MAE`	Mean absolute error for regression
`MSE`	Mean squared error for regression
`MSLE`	Mean squared log error for regression
`MedianAE`	Median absolute error for regression
`MaxError`	Maximum residual error for regression
`ExpVariance`	Explained variance score for regression

`ProblemTypes`	Enum for type of machine learning problem: BINARY, MULTICLASS, or REGRESSION
`handle_problem_types`	Handles problem_type by either returning the ProblemTypes or converting from a str

Bayesian Optimizer

`detect_highly_null`	Checks if there are any highly-null columns in a dataframe.
`detect_label_leakage`	Check if any of the features are highly correlated with the target.
`detect_outliers`	Checks if there are any outliers in a dataframe by using first Isolation Forest to obtain the anomaly score of each index and then using IQR to determine score anomalies.
`detect_id_columns`	Check if any of the features are ID columns.