IMAGINE

None, None; None, None; None, None; None, None

IMAGINE

An 8-to-1b 22nm FD-SOI Compute-In-Memory CNN Accelerator With an End-to-End Analog Charge-Based 0.15-8POPS/W Macro Featuring Distribution-Aware Data Reshaping

Journal Article (2025)

Author(s)

Adrian Kneip (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Martin Lefebvre (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Pol Maistriaux (Université Catholique de Louvain)

David Bol (Université Catholique de Louvain)

Research Group

Electronic Instrumentation

SRAM Convolutional neural networks (CNNs) 22nm FD-SOI Analog batch-normalization (ABN) Charge-based dot-product (DP) Computing in-memory (CIM) Hardware-aware training

DOI related publication

https://doi.org/10.1109/TCASAI.2025.3576323 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:5166ea9e-4ac2-4cb2-9ba8-67727b5dbe37

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Electronic Instrumentation

Journal title

IEEE Transactions on Circuits and Systems for Artificial Intelligence

Issue number

3

Volume number

2

Pages (from-to)

222-235

Downloads counter

17

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Charge-domain compute-in-memory (CIM) SRAMs have recently become an enticing compromise between computing efficiency and accuracy to process sub-8b convolutional neural networks (CNNs) at the edge. Yet, they commonly make use of a fixed dot-product (DP) voltage swing, which leads to a loss in effective ADC bits due to data-dependent clipping or truncation effects that waste precious conversion energy and computing accuracy. To overcome this, we present IMAGINE, a workload-adaptive 1-to-8b CIM-CNN accelerator in 22nm FD-SOI. It introduces a 1152×256 end-to-end charge-based macro with a multi-bit DP based on an input-serial, weight-parallel accumulation that avoids power-hungry DACs. An adaptive swing is achieved by combining a channel-wise DP array split with a linear in-ADC implementation of analog batch-normalization (ABN), obtaining a distribution-aware data reshaping. Critical design constraints are relaxed by including the post-silicon equivalent noise within a CIM-aware CNN training framework. Measurement results showcase an 8b system-level energy efficiency of 40TOPS/W at 0.3/0.6V, with competitive accuracies on MNIST and CIFAR-10. Moreover, the peak energy and area efficiencies of the 187kB/mm² macro respectively reach up to 0.15-8POPS/W and 2.6-154TOPS/mm², scaling with the 8-to-1b computing precision. These results exceed previous charge-based designs by 3-to-5× while being the first work to provide linear in-memory rescaling.

Files

IMAGINE_An_8-to-1b_22nm_FD-SOI... (pdf)

(pdf | 4.96 Mb)

- Embargo expired in 07-05-2026

Taverne