Towards lossless binary convolutional neural networks using piecewise approximation

Book Chapter (2020)
Author(s)

Baozhou Zhu (TU Delft - Computer Engineering)

Zaid Al-Ars (TU Delft - Computer Engineering)

Wei Pan (TU Delft - Robust Robot Systems)

Research Group
Computer Engineering
DOI related publication
https://doi.org/10.3233/FAIA200286 Final published version
More Info
expand_more
Publication Year
2020
Language
English
Research Group
Computer Engineering
Volume number
325
Pages (from-to)
1730-1737
ISBN (print)
978-1-64368-100-9
ISBN (electronic)
978-1-64368-101-6
Event
24th European Conference on Artificial Intelligence, ECAI 2020, including 10th Conference on Prestigious Applications of Artificial Intelligence, PAIS 2020 (2020-08-29 - 2020-09-08), Santiago de Compostela, Online, Spain
Downloads counter
258
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Binary Convolutional Neural Networks (CNNs) can significantly reduce the number of arithmetic operations and the size of memory storage, which makes the deployment of CNNs on mobile or embedded systems more promising. However, the accuracy degradation of single and multiple binary CNNs is unacceptable for modern architectures and large scale datasets like ImageNet. In this paper, we proposed a Piecewise Approximation (PA) scheme for multiple binary CNNs which lessens accuracy loss by approximating full precision weights and activations efficiently, and maintains parallelism of bitwise operations to guarantee efficiency. Unlike previous approaches, the proposed PA scheme segments piece-wisely the full precision weights and activations, and approximates each piece with a scaling coefficient. Our implementation on ResNet with different depths on ImageNet can reduce both Top-1 and Top-5 classification accuracy gap compared with full precision to approximately 1.0%. Benefited from the binarization of the downsampling layer, our proposed PA-ResNet50 requires less memory usage and two times Flops than single binary CNNs with 4 weights and 5 activations bases. The PA scheme can also generalize to other architectures like DenseNet and MobileNet with similar approximation power as ResNet which is promising for other tasks using binary convolutions. The code and pretrained models will be publicly available.