Short-term forecasting of non-stationary time series using multiple feature selection methods
M.I. Özyar (TU Delft - Electrical Engineering, Mathematics and Computer Science)
F.A. Oliehoek – Mentor (TU Delft - Interactive Intelligence)
Miguel Suau de Castro – Graduation committee member (TU Delft - Interactive Intelligence)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Time series forecasting has been proved to be relatively easier for stationary time series, compared to non-stationary time series. This research proposes a method to partially omit the non-stationarity of the data using prioritized sampling. Using multiple feature selection methods in combination with a random forest regressor (RFR), we aim to predict the values for a non-stationary time series. In particular, the principal component analysis (PCA), kernel PCA, incremental PCA and independent component analysis methods are used. The features extracted from these methods will be fed into an RFR both individually and combined, using the union and intersection operators. The features given by the IPCA ∪ PCA ∪ KPCA method, using prioritized sampling with multiple features per day provide the best improvement over the baseline.