Algorithms for Efficient Inference in Convolutional Neural Networks

Doctoral Thesis (2021)
Author(s)

Zhu Zhu (TU Delft - Computer Engineering)

Research Group
Computer Engineering
Copyright
© 2021 B. Zhu
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 B. Zhu
Research Group
Computer Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In recent years, the accuracy of Deep Neural Networks (DNNs) has improved significantly because of three main factors: the availability of massive amounts training data, the introduction of powerful low-cost computational resources, and the development of complex deep learning models. The cloud can provide powerful computational resources to calculate DNNs but limits their deployment due to data communication and privacy issues. Thus, computing DNNs at the edge is becoming an important alternative to calculating these models in a centralized service. However, there is a mismatch between the resource-constrained devices at the edge and the models with increased computational complexity. To alleviate this mismatch, both the algorithms and hardware need to be explored to improve the efficiency of training various feedforward and recurrent neural networks and inferring using a DNN.

Files

Doctoral_dissertation.pdf
(pdf | 2.65 Mb)
License info not available