Towards lossless binary convolutional neural networks using piecewise approximation

None, None; None, None; None, None

Towards lossless binary convolutional neural networks using piecewise approximation

Book Chapter (2020)

Author(s)

Baozhou Zhu (TU Delft - Computer Engineering)

Zaid Al-Ars (TU Delft - Computer Engineering)

Wei Pan (TU Delft - Robust Robot Systems)

Research Group

Computer Engineering

DOI related publication

https://doi.org/10.3233/FAIA200286 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:b760b9cc-c61b-4954-9033-15a426f5b550

More Info

expand_more

Publication Year

2020

Language

English

Research Group

Computer Engineering

Volume number

325

Pages (from-to)

1730-1737

ISBN (print)

978-1-64368-100-9

ISBN (electronic)

978-1-64368-101-6

Event

24th European Conference on Artificial Intelligence, ECAI 2020, including 10th Conference on Prestigious Applications of Artificial Intelligence, PAIS 2020 (2020-08-29 - 2020-09-08), Santiago de Compostela, Online, Spain

Downloads counter

258

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Binary Convolutional Neural Networks (CNNs) can significantly reduce the number of arithmetic operations and the size of memory storage, which makes the deployment of CNNs on mobile or embedded systems more promising. However, the accuracy degradation of single and multiple binary CNNs is unacceptable for modern architectures and large scale datasets like ImageNet. In this paper, we proposed a Piecewise Approximation (PA) scheme for multiple binary CNNs which lessens accuracy loss by approximating full precision weights and activations efficiently, and maintains parallelism of bitwise operations to guarantee efficiency. Unlike previous approaches, the proposed PA scheme segments piece-wisely the full precision weights and activations, and approximates each piece with a scaling coefficient. Our implementation on ResNet with different depths on ImageNet can reduce both Top-1 and Top-5 classification accuracy gap compared with full precision to approximately 1.0%. Benefited from the binarization of the downsampling layer, our proposed PA-ResNet50 requires less memory usage and two times Flops than single binary CNNs with 4 weights and 5 activations bases. The PA scheme can also generalize to other architectures like DenseNet and MobileNet with similar approximation power as ResNet which is promising for other tasks using binary convolutions. The code and pretrained models will be publicly available.

Files

FAIA_325_FAIA200286.pdf

(pdf | 0.457 Mb)