time_series_featurizer
=======================================================================================

.. py:module:: evalml.pipelines.components.transformers.preprocessing.time_series_featurizer

.. autoapi-nested-parse::

   Transformer that delays input features and target variable for time series problems.


Module Contents
---------------

Classes Summary
~~~~~~~~~~~~~~~

.. autoapisummary::

   evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer


Contents
~~~~~~~~~~~~~~~~~~~
.. py:class:: TimeSeriesFeaturizer(time_index=None, max_delay=2, gap=0, forecast_horizon=1, conf_level=0.05, rolling_window_size=0.25, delay_features=True, delay_target=True, random_seed=0, **kwargs)


   Transformer that delays input features and target variable for time series problems.

   This component uses an algorithm based on the autocorrelation values of the target variable
   to determine which lags to select from the set of all possible lags.

   The algorithm is based on the idea that the local maxima of the autocorrelation function indicate the lags that have
   the most impact on the present time.

   The algorithm computes the autocorrelation values and finds the local maxima, called "peaks", that are significant at the given
   conf_level. Since lags in the range [0, 10] tend to be predictive but not local maxima, the union of the peaks is taken
   with the significant lags in the range [0, 10]. At the end, only selected lags in the range [0, max_delay] are used.

   Parametrizing the algorithm by conf_level lets the AutoMLAlgorithm tune the set of lags chosen so that the chances
   of finding a good set of lags is higher.

   Using conf_level value of 1 selects all possible lags.

   :param time_index: Name of the column containing the datetime information used to order the data. Ignored.
   :type time_index: str
   :param max_delay: Maximum number of time units to delay each feature. Defaults to 2.
   :type max_delay: int
   :param forecast_horizon: The number of time periods the pipeline is expected to forecast.
   :type forecast_horizon: int
   :param conf_level: Float in range (0, 1] that determines the confidence interval size used to select
                      which lags to compute from the set of [1, max_delay]. A delay of 1 will always be computed. If 1,
                      selects all possible lags in the set of [1, max_delay], inclusive.
   :type conf_level: float
   :param rolling_window_size: Float in range (0, 1] that determines the size of the window used for rolling
                               features. Size is computed as rolling_window_size * max_delay.
   :type rolling_window_size: float
   :param delay_features: Whether to delay the input features. Defaults to True.
   :type delay_features: bool
   :param delay_target: Whether to delay the target. Defaults to True.
   :type delay_target: bool
   :param gap: The number of time units between when the features are collected and
               when the target is collected. For example, if you are predicting the next time step's target, gap=1.
               This is only needed because when gap=0, we need to be sure to start the lagging of the target variable
               at 1. Defaults to 1.
   :type gap: int
   :param random_seed: Seed for the random number generator. This transformer performs the same regardless of the random seed provided.
   :type random_seed: int


   **Attributes**

   .. list-table::
      :widths: 15 85
      :header-rows: 0

      * - **df_colname_prefix**
        - {}_delay_{}
      * - **hyperparameter_ranges**
        - Real(0.001, 1.0),    "rolling_window_size": Real(0.001, 1.0)}:type: {"conf_level"
      * - **modifies_features**
        - True
      * - **modifies_target**
        - False
      * - **name**
        - Time Series Featurizer
      * - **needs_fitting**
        - True
      * - **target_colname_prefix**
        - target_delay_{}
      * - **training_only**
        - False


   **Methods**

   .. autoapisummary::
      :nosignatures:

      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.clone
      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.default_parameters
      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.describe
      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.fit
      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.fit_transform
      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.load
      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.parameters
      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.save
      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.transform
      evalml.pipelines.components.transformers.preprocessing.time_series_featurizer.TimeSeriesFeaturizer.update_parameters

   .. py:method:: clone(self)

      Constructs a new component with the same parameters and random state.

      :returns: A new instance of this component with identical parameters and random state.


   .. py:method:: default_parameters(cls)

      Returns the default parameters for this component.

      Our convention is that Component.default_parameters == Component().parameters.

      :returns: Default parameters for this component.
      :rtype: dict


   .. py:method:: describe(self, print_name=False, return_dict=False)

      Describe a component and its parameters.

      :param print_name: whether to print name of component
      :type print_name: bool, optional
      :param return_dict: whether to return description as dictionary in the format {"name": name, "parameters": parameters}
      :type return_dict: bool, optional

      :returns: Returns dictionary if return_dict is True, else None.
      :rtype: None or dict


   .. py:method:: fit(self, X, y=None)

      Fits the DelayFeatureTransformer.

      :param X: The input training data of shape [n_samples, n_features]
      :type X: pd.DataFrame or np.ndarray
      :param y: The target training data of length [n_samples]
      :type y: pd.Series, optional

      :returns: self

      :raises ValueError: if self.time_index is None


   .. py:method:: fit_transform(self, X, y=None)

      Fit the component and transform the input data.

      :param X: Data to transform.
      :type X: pd.DataFrame
      :param y: Target.
      :type y: pd.Series, or None

      :returns: Transformed X.
      :rtype: pd.DataFrame


   .. py:method:: load(file_path)
      :staticmethod:

      Loads component at file path.

      :param file_path: Location to load file.
      :type file_path: str

      :returns: ComponentBase object


   .. py:method:: parameters(self)
      :property:

      Returns the parameters which were used to initialize the component.


   .. py:method:: save(self, file_path, pickle_protocol=cloudpickle.DEFAULT_PROTOCOL)

      Saves component at file path.

      :param file_path: Location to save file.
      :type file_path: str
      :param pickle_protocol: The pickle data stream format.
      :type pickle_protocol: int


   .. py:method:: transform(self, X, y=None)

      Computes the delayed values and rolling means for X and y.

      The chosen delays are determined by the autocorrelation function of the target variable. See the class docstring
      for more information on how they are chosen. If y is None, all possible lags are chosen.

      If y is not None, it will also compute the delayed values for the target variable.

      The rolling means for all numeric features in X and y, if y is numeric, are also returned.

      :param X: Data to transform. None is expected when only the target variable is being used.
      :type X: pd.DataFrame or None
      :param y: Target.
      :type y: pd.Series, or None

      :returns: Transformed X. No original features are returned.
      :rtype: pd.DataFrame


   .. py:method:: update_parameters(self, update_dict, reset_fit=True)

      Updates the parameter dictionary of the component.

      :param update_dict: A dict of parameters to update.
      :type update_dict: dict
      :param reset_fit: If True, will set `_is_fitted` to False.
      :type reset_fit: bool, optional