Improving the performance of Recurrent Neural Networks for time series prediction by combining Long Short-Term Memory and Attention Long Short-Term Memory

Bachelor thesis (2021)

Authors

A.J. van Diepen Electrical Engineering, Mathematics and Computer Science

Contributors

J.W. Böhmer Algorithmics - (supervisor 1)

C. Lofi Web Information Systems - (supervisor 2)

Faculty

Electrical Engineering, Mathematics and Computer Science

LSTM ALSTM RNN Timeseries data Long Short-Term Memory networks Attention Attention Long Short-Term Memory networks Combination Prediction Model

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:e884c40c-e7e8-40b1-a413-f570d5518d0d

Published Date

02-07-2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Recurrent neural networks (RNNs) used in time series prediction are still not perfect in their predictions and improvements can still be made in the area. Most recently transformers have led to great improvements in the field of RNNs, however transformers can not be used on time series data, because the architecture of transformers does not account for the flow of time and would use future data to predict past events. This research aims to further improve the performance of machine learning models on time-series prediction. It attempts to do so by implementing a new neural network model based on the multi-head attention mechanism (used in transformers) and combining it with an already existing neural network model called long short term memory (LSTM). To test whether the newly implemented models have improved performance they are tested on a weather dataset and compared on their ability to correctly predict daily maximum temperatures. The final results however show that combining LSTM and ALSTM models does not results in an improved loss that is worth the extra instability that is added to the model and the extra computational cost that is needed to train the model.

Files

Research_Project_1.pdf

(.pdf | 0.84 Mb)