B. Zhu | TU Delft Repository

Algorithms for Efficient Inference in Convolutional Neural Networks

Doctoral thesis (2021) - B. Baozhou (author)

In recent years, the accuracy of Deep Neural Networks (DNNs) has improved significantly because of three main factors: the availability of massive amounts training data, the introduction of powerful low-cost computational resources, and the development of complex deep learning mo ...

An Attention Module for Convolutional Neural Networks

Conference paper (2021) - Baozhou Baozhou (author) , Peter Peter Hofstee (author) , Jinho Lee (author) , Z. Al-Ars (author)

Attention mechanism has been regarded as an advanced technique to capture long-range feature interactions and to boost the representation capability for convolutional neural networks. However, we found two ignored problems in current attentional activations-based models: the appr ...

ReAF

Reducing approximation of channels by reducing feature reuse within convolution

Journal article (2020) - Zhu Zhu (author) , Z. Al-Ars (author) , Peter Peter Hofstee (author)

High-level feature maps of Convolutional Neural Networks are computed by reusing their corresponding low-level feature maps, which brings into full play feature reuse to improve the computational efficiency. This form of feature reuse is referred to as feature reuse between convo ...

Towards lossless binary convolutional neural networks using piecewise approximation

Book chapter (2020) - Zhu Zhu (author) , Zaid Al-Ars (author) , Wei Pan (author)

Binary Convolutional Neural Networks (CNNs) can significantly reduce the number of arithmetic operations and the size of memory storage, which makes the deployment of CNNs on mobile or embedded systems more promising. However, the accuracy degradation of single and multiple binar ...

NASB

Neural Architecture Search for Binary Convolutional Neural Networks

Conference paper (2020) - Zhu Zhu (author) , Z. Al-Ars (author) , Peter Peter Hofstee (author)

Binary Convolutional Neural Networks (CNNs) have significantly reduced the number of arithmetic operations and the size of memory storage needed for CNNs, which makes their deployment on mobile and embedded systems more feasible. However, after binarization, the CNN architecture ...

Diminished-1 Fermat Number Transform for Integer Convolutional Neural Networks

Conference paper (2019) - Zhu Zhu (author) , N. Ahmed (author) , Johan Peltenburg (author) , K Bertels (author) , Zaid Al-Ars (author)

Convolutional Neural Networks (CNNs) are a class of widely used deep artificial neural networks. However, training large CNNs to produce state-of-the-art results can take a long time. In addition, we need to reduce compute time of the inference stage for trained networks to make ...