A New Logarithmic Quantization Technique and Corresponding Processing Element Design for CNN Accelerators

None, None

A New Logarithmic Quantization Technique and Corresponding Processing Element Design for CNN Accelerators

Master Thesis (2022)

Author(s)

L. Jiang (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

T.G.R.M. van Leuken – Mentor (TU Delft - Signal Processing Systems)

D. Aledo Ortega – Mentor (TU Delft - Signal Processing Systems)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

FPGA Convolutional Neural Network Low-power hardware acceleration Logarithmic Quantization

To reference this document use:

https://resolver.tudelft.nl/uuid:429aea71-8c71-4b91-bb75-00811d75b7b7

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Graduation Date

29-11-2022

Awarding Institution

Delft University of Technology

Programme

['Electrical Engineering | Circuits and Systems']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Convolutional Neural Networks (CNN) have become a popular solution for computer vision problems. However, due to the high data volumes and intensive computation involved in CNNs, deploying CNNs on low-power hardware systems is still challenging.
The power consumption of CNNs can be prohibitive in the most common implementation platforms: CPUs and GPUs. Therefore, hardware accelerators that can exploit CNN parallelism and methods to reduce the computation burden or memory requirements are still hot research topics. Quantization is one of these methods. One suitable quantization strategy for low-power deployments is logarithmic quantization.

Logarithmic quantization for Convolutional Neural Networks (CNN): a) fits well typical weights and activation distributions, and b) allows the replacement of the multiplication operation by a shift operation that can be implemented with fewer hardware resources.
In this thesis, a new quantization method named Jumping Log Quantization (JLQ) is proposed. The key idea of JLQ is to extend the quantization range, by adding a coefficient parameter "s" in the power of two exponents ($2^{sx+i}$).

This quantization strategy skips some values from the standard logarithmic quantization. In addition, a small hardware-friendly optimization called weight de-zeroing is proposed in this work. Zero-valued weights that cannot be performed by a single shift operation are all replaced with logarithmic weights to reduce hardware resources with little accuracy loss.

To implement the Multiply-And-Accumulate (MAC) operation (needed to compute convolutions) when the weights are JLQ-ed and de-zeroed, a new Processing Element (PE) have been developed. This new PE uses a modified barrel shifter that can efficiently avoid the skipped values.
Resource utilization, area, and power consumption of the new PE standing alone are reported. Resource utilization and power consumption in a systolic-array-based accelerator are also reported.
The results show that JLQ performs better than other state-of-the-art logarithmic quantization methods when the bit width of the operands becomes very small.

Files

Thesis.pdf

(pdf | 2.88 Mb)

License info not available