import pandas as pd
import woodwork as ww
from sklearn.datasets import load_breast_cancer as load_breast_cancer_sk
[docs]def load_breast_cancer(return_pandas=False):
"""Load breast cancer dataset. Binary classification problem.
Returns:
Union[(ww.DataTable, ww.DataColumn), (pd.Dataframe, pd.Series)]: X and y
"""
data = load_breast_cancer_sk()
X = pd.DataFrame(data.data, columns=data.feature_names)
y = pd.Series(data.target)
y = y.map(lambda x: data["target_names"][x])
if return_pandas:
return X, y
return ww.DataTable(X), ww.DataColumn(y)