HERMES-Core-A 1.59-TOPS/mm2PCM on 14-nm CMOS In-Memory Compute Core Using 300-ps/LSB Linearized CCO-Based ADCs

Journal Article (2022)
Author(s)

Riduan Khaddam-Aljameh (Zurich Lab, Axelera AI)

Milos Stanisavljevic (Axelera AI, Zurich Lab)

Jordi Fornt Mas (Zurich Lab, Barcelona Supercomputing Center)

Geethan Karunaratne (Zurich Lab)

Matthias Brandli (Zurich Lab)

Feng Liu (IBM Research)

Abhairaj Singh (TU Delft - Computer Engineering)

Silvia M. Muller (IBM Systems and Technology)

Urs Egger (Zurich Lab)

undefined More Authors (External organisation)

Research Group
Computer Engineering
DOI related publication
https://doi.org/10.1109/JSSC.2022.3140414
More Info
expand_more
Publication Year
2022
Language
English
Research Group
Computer Engineering
Journal title
IEEE Journal of Solid-State Circuits
Issue number
4
Volume number
57
Pages (from-to)
1027-1038
Downloads counter
387
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

We present a 256 × 256 in-memory compute (IMC) core designed and fabricated in 14-nm CMOS technology with backend-integrated multi-level phase change memory (PCM). It comprises 256 linearized current-controlled oscillator (CCO)-based A/D converters (ADCs) at a compact 4-μm pitch and a local digital processing unit (LDPU) performing affine scaling and ReLU operations. A frequency-linearization technique for CCO is introduced, which increases the maximum CCO frequency beyond 3 GHz, while ensuring accurate on-chip matrix-vector multiplications (MVMs). Moreover, the design and functionality of the digital ADC calibration procedure is described in detail and the MVM accuracy is quantified. Finally, the measured classification accuracies of deep learning (DL) inference applications on the MNIST and CIFAR-10 datasets, when two IMC cores are employed, are presented. For a performance density of 1.59 TOPS/mm2, a measured energy efficiency of 10.5 TOPS/W, at a main clock frequency of 1 GHz, is achieved.