HAS-RL: A Hierarchical Approximate Scheme Optimized With Reinforcement Learning for NoC-Based NN Accelerators

None, None; None, None; None, None; None, None; None, None; None, None; None, None; None, None; None, None; None, None

HAS-RL: A Hierarchical Approximate Scheme Optimized With Reinforcement Learning for NoC-Based NN Accelerators

Journal Article (2024)

Author(s)

Siyue Li (Nanjing University)

Shize Zhou (Nanjing University)

Yongqi Xue (Nanjing University)

Wenjie Fan (Nanjing University)

Tong Cheng (Nanjing University)

Jinlun Ji (Nanjing University)

Chenyang Dai (Nanjing University)

Wenqing Song (Nanjing University)

Chang Gao (TU Delft - Electrical Engineering, Mathematics and Computer Science)

undefined More Authors (External organisation)

Research Group

Electronics

Neural network Offline reinforcement learning Approximate communication Network-on-chip

DOI related publication

https://doi.org/10.1109/TCSI.2024.3359912 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:9b96d465-8916-4499-8147-6b0f9faee334

More Info

expand_more

Publication Year

2024

Language

English

Research Group

Electronics

Issue number

4

Volume number

71

Pages (from-to)

1863-1875

Downloads counter

306

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Network-on-Chip (NoC) is a scalable on-chip communication architecture for the NN accelerator, but with the increase in the number of nodes, the communication delay becomes higher. Applications such as machine learning have a certain resilience to noisy/erroneous transmitted data. Therefore, approximate communication becomes a promising solution to improving performance by reducing traffic loads under the constraint of the acceptable maximum accuracy loss of neural networks. It is a key issue to balance the result quality and the communication delay for approximate NoC systems. The traditional approximate NoC only considers the node-to-node approximation-based dynamic traffic regulation. However, the dynamically changing traffic patterns across different nodes, different times, and different applications lead to a huge search space, which makes it hard to explore an optimal global approximation solution. In this paper, we propose a quality model for different neural networks, which presents the relationship between the quality loss and the data approximate rate. Then, a hierarchical approximate scheme optimized with reinforcement learning (HAS-RL) is proposed and we reduce the complexity of the HAS-RL by reducing the state space and action space, which will reduce the resource overhead as well. After that, we embed a global approximate controller in the NoC system, in which we deploy a policy network trained with the offline reinforcement learning algorithm to adjust the data approximate rates of each node at run time. Compared with the state-of-the-art method, the proposed scheme reduces the average network delay by $13.5\%$ while their accuracies are similar. The proposed HAS-RL only causes an additional area overhead of $1.24\%$ and power consumption of $0.77\%$ compared with the traditional router design.

Files

HAS-RL_A_Hierarchical_Approxim... (pdf)

(pdf | 2.19 Mb)

- Embargo expired in 19-08-2024

License info not available