Visual Interpretation of Recurrent Neural Network on Multi-dimensional Time-series Forecast

Conference Paper (2020)
Author(s)

Qiaomu Shen (The Hong Kong University of Science and Technology)

Yanhong Wu (Visa Research)

Yuzhe Jiang (The Hong Kong University of Science and Technology)

Wei Zeng (Shenzhen Institute of Advanced Technologies)

Alexis K.H. Lau (The Hong Kong University of Science and Technology)

A. Vilanova Bartroli (TU Delft - Computer Graphics and Visualisation)

Huamin Qu (The Hong Kong University of Science and Technology)

Research Group
Computer Graphics and Visualisation
Copyright
© 2020 Qiaomu Shen, Yanhong Wu, Yuzhe Jiang, Wei Zeng, Alexis K.H. Lau, A. Vilanova Bartroli, Huamin Qu
DOI related publication
https://doi.org/10.1109/PacificVis48177.2020.2785
More Info
expand_more
Publication Year
2020
Language
English
Copyright
© 2020 Qiaomu Shen, Yanhong Wu, Yuzhe Jiang, Wei Zeng, Alexis K.H. Lau, A. Vilanova Bartroli, Huamin Qu
Research Group
Computer Graphics and Visualisation
Bibliographical Note
Accepted author manuscript@en
Volume number
2020-June
Pages (from-to)
61-70
ISBN (electronic)
9781728156972
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Recent attempts at utilizing visual analytics to interpret Recurrent Neural Networks (RNNs) mainly focus on natural language processing (NLP) tasks that take symbolic sequences as input. However, many real-world problems like environment pollution forecasting apply RNNs on sequences of multi-dimensional data where each dimension represents an individual feature with semantic meaning such as PM2.5 and SO2. RNN interpretation on multi-dimensional sequences is challenging as users need to analyze what features are important at different time steps to better understand model behavior and gain trust in prediction. This requires effective and scalable visualization methods to reveal the complex many-to-many relations between hidden units and features. In this work, we propose a visual analytics system to interpret RNNs on multi-dimensional time-series forecasts. Specifically, to provide an overview to reveal the model mechanism, we propose a technique to estimate the hidden unit response by measuring how different feature selections affect the hidden unit output distribution. We then cluster the hidden units and features based on the response embedding vectors. Finally, we propose a visual analytics system which allows users to visually explore the model behavior from the global and individual levels. We demonstrate the effectiveness of our approach with case studies using air pollutant forecast applications.

Files

License info not available