Visual Interpretation of Recurrent Neural Network on Multi-dimensional Time-series Forecast

Conference Paper (2020)
Author(s)

Qiaomu Shen (The Hong Kong University of Science and Technology)

Yanhong Wu (Visa Research)

Yuzhe Jiang (The Hong Kong University of Science and Technology)

Wei Zeng (Shenzhen Institute of Advanced Technologies)

Alexis K.H. Lau (The Hong Kong University of Science and Technology)

Anna Vianova (TU Delft - Computer Graphics and Visualisation)

Huamin Qu (The Hong Kong University of Science and Technology)

DOI related publication
https://doi.org/10.1109/PacificVis48177.2020.2785 Final published version
More Info
expand_more
Publication Year
2020
Language
English
Bibliographical Note
Accepted author manuscript
Volume number
2020-June
Article number
9086238
Pages (from-to)
61-70
ISBN (electronic)
9781728156972
Event
Downloads counter
318
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Recent attempts at utilizing visual analytics to interpret Recurrent Neural Networks (RNNs) mainly focus on natural language processing (NLP) tasks that take symbolic sequences as input. However, many real-world problems like environment pollution forecasting apply RNNs on sequences of multi-dimensional data where each dimension represents an individual feature with semantic meaning such as PM2.5 and SO2. RNN interpretation on multi-dimensional sequences is challenging as users need to analyze what features are important at different time steps to better understand model behavior and gain trust in prediction. This requires effective and scalable visualization methods to reveal the complex many-to-many relations between hidden units and features. In this work, we propose a visual analytics system to interpret RNNs on multi-dimensional time-series forecasts. Specifically, to provide an overview to reveal the model mechanism, we propose a technique to estimate the hidden unit response by measuring how different feature selections affect the hidden unit output distribution. We then cluster the hidden units and features based on the response embedding vectors. Finally, we propose a visual analytics system which allows users to visually explore the model behavior from the global and individual levels. We demonstrate the effectiveness of our approach with case studies using air pollutant forecast applications.

Files

License info not available