IMAGINE

An 8-to-1b 22nm FD-SOI Compute-In-Memory CNN Accelerator With an End-to-End Analog Charge-Based 0.15-8POPS/W Macro Featuring Distribution-Aware Data Reshaping

Journal Article (2025)
Author(s)

Adrian Kneip (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Martin Lefebvre (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Pol Maistriaux (Université Catholique de Louvain)

David Bol (Université Catholique de Louvain)

Research Group
Electronic Instrumentation
DOI related publication
https://doi.org/10.1109/TCASAI.2025.3576323 Final published version
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Electronic Instrumentation
Journal title
IEEE Transactions on Circuits and Systems for Artificial Intelligence
Issue number
3
Volume number
2
Pages (from-to)
222-235
Downloads counter
4
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Charge-domain compute-in-memory (CIM) SRAMs have recently become an enticing compromise between computing efficiency and accuracy to process sub-8b convolutional neural networks (CNNs) at the edge. Yet, they commonly make use of a fixed dot-product (DP) voltage swing, which leads to a loss in effective ADC bits due to data-dependent clipping or truncation effects that waste precious conversion energy and computing accuracy. To overcome this, we present IMAGINE, a workload-adaptive 1-to-8b CIM-CNN accelerator in 22nm FD-SOI. It introduces a 1152×256 end-to-end charge-based macro with a multi-bit DP based on an input-serial, weight-parallel accumulation that avoids power-hungry DACs. An adaptive swing is achieved by combining a channel-wise DP array split with a linear in-ADC implementation of analog batch-normalization (ABN), obtaining a distribution-aware data reshaping. Critical design constraints are relaxed by including the post-silicon equivalent noise within a CIM-aware CNN training framework. Measurement results showcase an 8b system-level energy efficiency of 40TOPS/W at 0.3/0.6V, with competitive accuracies on MNIST and CIFAR-10. Moreover, the peak energy and area efficiencies of the 187kB/mm2 macro respectively reach up to 0.15-8POPS/W and 2.6-154TOPS/mm2, scaling with the 8-to-1b computing precision. These results exceed previous charge-based designs by 3-to-5× while being the first work to provide linear in-memory rescaling.

Files

IMAGINE_An_8-to-1b_22nm_FD-SOI... (pdf)
(pdf | 4.96 Mb)
- Embargo expired in 07-05-2026
Taverne