Binary Neural Networks for Object Detection

None, None

Binary Neural Networks for Object Detection

Master Thesis (2019)

Author(s)

Y. Wang (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Zaid Al-Ars – Mentor (TU Delft - Computer Engineering)

Wei Pan – Graduation committee member (TU Delft - Robot Dynamics)

AJ van Genderen – Graduation committee member (TU Delft - Computer Engineering)

Baozhou Zhu – Graduation committee member (TU Delft - Computer Engineering)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Deep Learning Computer Vision Artificial Intelligence Object Detection Neural Network Quantization

To reference this document use:

https://resolver.tudelft.nl/uuid:9f0da106-82ea-4f2e-9cd5-8bc834885d6f

More Info

expand_more

Publication Year

2019

Language

English

Copyright

Graduation Date

29-08-2019

Awarding Institution

Delft University of Technology

Programme

Electrical Engineering | Embedded Systems

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In the past few years, convolutional neural networks (CNNs) have been widely utilized and shown state-of-the-art performances on computer vision tasks. However, CNN based approaches usually require a large amount of storage, run-time memory, as well as computation power in both training and inference time, which are usually used on GPU based machines to ensure the speed for inferences. But they are usually insufficient to be deployed on low-power applications. Although many approaches were proposed to compress and accelerate the CNN models, most of them were only evaluated on relatively simple problems (e.g. image classification), which only support limited real-world applications. Especially, among those methods, binary quantization can achieve very high model compression, but only a few works have been observed to utilize it on more complex tasks. Therefore, the exploration and evaluations of applying binary quantization on more complex tasks like object detection are worthwhile, which can be used in much more applications like autonomous driving and face detection. In this project, we apply and evaluate two different binary quantization approaches, named ABC-Net and PA-Net on object detection tasks. Also, we specify the exact implementation details for the binary convolutional operations in this project. As a result, we can achieve maximally 6.1× (around 16% of the full-precision model) compression, and minimal 2.5% accuracy reduction for weight quantization. The weight quantized models were able to outperform some existing real-time detectors in terms of both accuracy and storage size. Although large accuracy reduction was observed for input quantization, the quantized model could still maintain an acceptable accuracy compared to existing real-time object detectors.

Files

MSc_Thesis_Yizhou_Wang.pdf

(pdf | 1.8 Mb)

- Embargo expired in 01-09-2020

License info not available