dask_engine¶
Module Contents¶
Classes Summary¶
A Future-like wrapper around jobs created by the DaskEngine. |
|
The dask engine |
Contents¶
-
class
evalml.automl.engine.dask_engine.
DaskComputation
(dask_future)[source]¶ A Future-like wrapper around jobs created by the DaskEngine.
- Parameters
dask_future (callable) – Computation to do.
Methods
Cancel the current computation.
- returns
Whether the computation is done.
Gets the computation result.
- returns
Returns whether computation was cancelled.
-
get_result
(self)[source]¶ Gets the computation result. Will block until the computation is finished.
- Raises
Exception – If computation fails. Returns traceback.
-
property
is_cancelled
(self)¶ - Returns
Returns whether computation was cancelled.
- Return type
bool
-
class
evalml.automl.engine.dask_engine.
DaskEngine
(client)[source]¶ The dask engine
Methods
Send data to the cluster.
Send evaluation job to cluster.
Send scoring job to cluster.
Send training job to cluster.
-
send_data_to_cluster
(self, X, y)[source]¶ Send data to the cluster.
The implementation uses caching so the data is only sent once. This follows dask best practices.
- Parameters
X (pd.DataFrame) – input data for modeling
y (pd.Series) – target data for modeling
- Returns
the modeling data
- Return type
dask.Future
-
static
setup_job_log
()¶
-
submit_evaluation_job
(self, automl_config, pipeline, X, y) → evalml.automl.engine.engine_base.EngineComputation[source]¶ Send evaluation job to cluster.
- Parameters
automl_config – structure containing data passed from AutoMLSearch instance
pipeline (pipeline.PipelineBase) – pipeline to evaluate
X (pd.DataFrame) – input data for modeling
y (pd.Series) – target data for modeling
- Returns
- a object wrapping a reference to a future-like computation
occurring in the dask cluster
- Return type
-
submit_scoring_job
(self, automl_config, pipeline, X, y, objectives) → evalml.automl.engine.engine_base.EngineComputation[source]¶ Send scoring job to cluster.
- Parameters
automl_config – structure containing data passed from AutoMLSearch instance
pipeline (pipeline.PipelineBase) – pipeline to train
X (pd.DataFrame) – input data for modeling
y (pd.Series) – target data for modeling
- Returns
- a object wrapping a reference to a future-like computation
occurring in the dask cluster
- Return type
-
submit_training_job
(self, automl_config, pipeline, X, y) → evalml.automl.engine.engine_base.EngineComputation[source]¶ Send training job to cluster.
- Parameters
automl_config – structure containing data passed from AutoMLSearch instance
pipeline (pipeline.PipelineBase) – pipeline to train
X (pd.DataFrame) – input data for modeling
y (pd.Series) – target data for modeling
- Returns
- a object wrapping a reference to a future-like computation
occurring in the dask cluster
- Return type
-