Algorithms for Efficient Inference in Convolutional Neural Networks

Doctoral thesis (2021)

Authors

B. Zhu Computer Engineering -

Research Group

Computer Engineering () (TU Delft)

DOI: https://doi.org/10.4233/uuid:0943e030-7486-4ee6-8e7e-b35d02d528b0

Efficiency Attention Reconstruction Convolution neural network Approximation Architecture design Feature reuse Search

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:0943e030-7486-4ee6-8e7e-b35d02d528b0

Published Date

2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Quantum & Computer Engineering

Research Group

Computer Engineering

Abstract

In recent years, the accuracy of Deep Neural Networks (DNNs) has improved significantly because of three main factors: the availability of massive amounts training data, the introduction of powerful low-cost computational resources, and the development of complex deep learning models. The cloud can provide powerful computational resources to calculate DNNs but limits their deployment due to data communication and privacy issues. Thus, computing DNNs at the edge is becoming an important alternative to calculating these models in a centralized service. However, there is a mismatch between the resource-constrained devices at the edge and the models with increased computational complexity. To alleviate this mismatch, both the algorithms and hardware need to be explored to improve the efficiency of training various feedforward and recurrent neural networks and inferring using a DNN.

Files

Doctoral_dissertation.pdf

(.pdf | 2.65 Mb)