Lightweight and Accurate DNN-Based Anomaly Detection at Edge

None, None; None, None; None, None; None, None; None, None; None, None

Lightweight and Accurate DNN-Based Anomaly Detection at Edge

Journal Article (2022)

Author(s)

Qinglong Zhang (Beijing Institute of Technology)

Rui Han (Beijing Institute of Technology)

Gaofeng Xin (Beijing Institute of Technology)

Chi Liu (Beijing Institute of Technology)

Guoren Wang (Beijing Institute of Technology)

Y. Chen (TU Delft - Data-Intensive Systems)

Research Group

Data-Intensive Systems

Copyright

DOI related publication

https://doi.org/10.1109/TPDS.2021.3137631

Anomaly detection DNN Model scaling Edge inference Predictable latency

To reference this document use:

https://resolver.tudelft.nl/uuid:c650e123-84b2-4607-8403-e0c7c1cbfaea

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Research Group

Data-Intensive Systems

Issue number

11

Volume number

33

Pages (from-to)

2927-2942

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deep neural networks (DNNs) have been showing significant success in various anomaly detection applications such as smart surveillance and industrial quality control. It is increasingly important to detect anomalies directly on edge devices, because of high responsiveness requirements and tight latency constraints. The accuracy of DNN-based solutions rely on large model capacity and thus long training and inference time, making them inapplicable on resource strenuous edge devices. It is hence imperative to scale DNN model sizes in correspondence to the run-time system requirements, i.e., meeting deadlines with minimal accuracy losses, which are highly dependent on the platforms and real-time system status. Existing scaling techniques either take long training time to pre-generate scaling options or disturb the unsteady training process of anomaly detection DNNs, lacking the adaptability to heterogeneous edge systems and incurring low inference accuracies. In this article, we present LightDNN to scale DNN models for anomaly detection applications at edge, featuring high detection accuracies with lightweight training and inference time. To this end, LightDNN quickly extracts and compresses blocks in a DNN, and provides large scaling space (e.g., 1 million options) by dynamically combining these compressed blocks online. At run-time, LightDNN predicts the DNN's inference latency according to the monitored system status, and optimizes the combination of blocks to maximize its accuracy under deadline constraints. We implement and extensively evaluate LightDNN on both CPU and GPU edge platforms using 8 popular anomaly detection workloads. Comparative experiments with state-of-the-art methods show that our approach provides 145.8 to 0.56 trillion times more scaling options without increasing training and inference overheads, thus achieving as much as 15.05% increase in accuracy under the same deadlines.

Files

Lightweight_and_Accurate_DNN_B... (pdf)

(pdf | 2.21 Mb)

- Embargo expired in 01-07-2023

License info not available