Extracting insights from black box neural networks for network-wide traffic predictions

None, None

Extracting insights from black box neural networks for network-wide traffic predictions

Master Thesis (2023)

Author(s)

H. Wang (TU Delft - Civil Engineering & Geosciences)

Contributor(s)

Hans Van Lint – Graduation committee member (TU Delft - Transport and Planning)

Panchamy Krishnakumari – Mentor (TU Delft - Transport and Planning)

Emir Demirovic – Graduation committee member (TU Delft - Algorithmics)

G. Li – Coach (TU Delft - Transport and Planning)

Faculty

Civil Engineering & Geosciences

Copyright

Explainable AI Image Classification Deep Learning Neural Network Traffic forecasting Transfer Learning

To reference this document use:

https://resolver.tudelft.nl/uuid:6fc2ff65-4cf9-47f1-8859-9792f278670c

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

24-02-2023

Awarding Institution

Delft University of Technology

Programme

['Civil Engineering | Transport and Planning']

Faculty

Civil Engineering & Geosciences

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Accurate and trustworthy short-term traffic prediction is crucial in the modern world for the comfort of drivers and decision-makers as it is used to improve the performance of traffic management systems, lessen congestion, increase safety, and shorten journey times. It is possible to discover useful information for network transportation planning, such as forecasting demand, finding bottlenecks, and prioritizing infrastructure improvements, by concentrating on network-wide traffic prediction.

Scholars have developed a variety of methods that can be generally divided into model-based and data-based methods in order to accurately predict network-wide traffic. However, while studies have demonstrated the capability of deep learning methods, particularly convolutional neural networks (CNNs), in predicting traffic states, the complex nonlinear spatial and temporal traffic characteristics, the time-consuming model creation and training, and the unexplained methodology and predictions continue to pose challenges to the task.

This thesis seeks to address these issues by analyzing how deep neural networks identify spatiotemporal traffic patterns for network-wide traffic predictions. To this end, a hybrid CNN-RNN model utilizing a pretrained Inception ResNet v2 feature extractor and a long short-term memory encoder-decoder is constructed to forecast network traffic speeds. A pretrained Inception ResNet v2-based image classifier is built based on the predictions to identify traffic patterns, and Grad-CAM is used to explore how the model identifies them. A freeway network in Amsterdam, Netherlands, is used as a case study.

While it is expected that the hybrid CNN-RNN model can give comparable performance to the state-of-the-art methods, e.g. the DGCN proposed by Li et al., results indicate that it cannot fully capture the
dynamic characteristics of the traffic, nor can it accurately provide predictions. The image classifier failed to identify the distinct traffic patterns as well, despite Grad-CAM's success in indicating locations with rapid changes of values.

Overall, the findings highlight the influence of inductive bias on deep learning models, and the importance of fine-tuning and model-data compatibility. Although further research is required, the conclusions are still beneficial to make informed decisions when choosing appropriate models for future network-wide traffic speed prediction tasks.

Files

Thesis_HeqiWang.pdf

(pdf | 11.7 Mb)

License info not available