M.Z. Zahedi | TU Delft Repository

BCIM

Efficient Implementation of Binary Neural Network Based on Computation in Memory

Journal article (2024) - Mahdi Zahedi , Taha Shahroodi , Carlos Escuin , Georgi Gaydadjiev , Stephan Wong , Said Hamdioui

Applications of Binary Neural Networks (BNNs) are promising for embedded systems with hard constraints on energy and computing power. Contrary to conventional neural networks using floating-point datatypes, BNNs use binarized weights and activations to reduce memory and computati ...

MNEMOSENE++: Scalable Multi-Tile Design with Enhanced Buffering and VGSOT-MRAM based Compute-in-Memory Crossbar Array

Conference paper (2023) - Carlos Escuin , Fernando García-Redondo , Francky Catthoor , Mahdi Zahedi , Pablo Ibáñez , Teresa Monreal , Victor Viñals , José María Llabería , James Myers , Julien Ryckaert , Dwaipayan Biswas

This paper optimizes the MNEMOSENE architecture, a compute-in-memory (CiM) tile design integrating computation and storage for increased efficiency. We identify and address bottlenecks in the Row Data (RD) buffer that cause losses in performance. Our proposed approach includes mi ...

Efficient Signed Arithmetic Multiplication on Memristor-based Crossbar

Journal article (2023) - Mahdi Zahedi , Taha Shahroodi , Stephan Wong , Said Hamdioui

The vast potential of memristor-based computation-in-memory (CIM) engines has mainly triggered the mapping of best-suited applications. Nevertheless, with additional support, existing applications can also benefit from CIM. In particular, this paper proposes an energy and area-ef ...

Lightspeed Binary Neural Networks using Optical Phase-Change Materials

Conference paper (2023) - Taha Shahroodi , Rafaela Cardoso , Mahdi Zahedi , Stephan Wong , Alberto Bosio , Ian O'Connor , Said Hamdioui

This paper investigates the potential of a compute-in-memory core based on optical Phase Change Materials (oPCMs) to speed up and reduce the energy consumption of the Matrix-Matrix-Multiplication operation. The paper also proposes a new data mapping for Binary Neural Networks (BN ...

SieveMem: A Computation-in-Memory Architecture for Fast and Accurate Pre-Alignment

Conference paper (2023) - Taha Shahroodi , Michael Miao , Mahdi Zahedi , Stephan Wong , Said Hamdioui

The high execution time of DNA sequence alignment negatively affects many genomic studies that rely on sequence alignment results. Pre-alignment filtering was introduced as a step before alignment to reduce the execution time of short-read sequence alignment greatly. With its suc ...

SparseMEM

Energy-efficient Design for In-memory Sparse-based Graph Processing

Conference paper (2023) - Mahdi Zahedi , Geert Custers , Taha Shahroodi , Georgi Gaydadjiev , Stephan Wong , Said Hamdioui

Performing analysis on large graph datasets in an energy-efficient manner has posed a significant challenge; not only due to excessive data movements and poor locality, but also due to the non-optimal use of high sparsity of such datasets. The latter leads to a waste of resources ...

Computation-in-memory from application-specific to programmable designs based on memristor devices

Doctoral thesis (2023) - M.Z. Zahedi , S. Hamdioui , J.S.S.M. Wong

Computation-in-Memory (CIM) is a promising alternative to traditional computing systems where the storage is conceptually separated fromthe computing units. Instead, the CIM paradigm aims to perform the computation where the data resides, alleviating the memory bottleneck and ult ...

Exploiting PUF Variation to Detect Fault Injection Attacks

Conference paper (2022) - Troya Köylü , Luiza Garaffa , Cezar Reinbrecht , Mahdi Zahedi , Said Hamdioui , Mottaqiallah Taouil

The massive deployment of Internet of Things (IoT) devices makes them vulnerable against physical tampering attacks, such as fault injection. These kind of hardware attacks are very popular as they typically do not require complex equipment or high expertise. Hence, it is importa ...

Demeter

A Fast and Energy-Efficient Food Profiler Using Hyperdimensional Computing in Memory

Journal article (2022) - Taha Shahroodi , Mahdi Zahedi , Can Firtina , Mohammed Alser , Stephan Wong , Onur Mutlu , Said Hamdioui

Food profiling is an essential step in any food monitoring system needed to prevent health risks and potential frauds in the food industry. Significant improvements in sequencing technologies are pushing food profiling to become the main computational bottleneck. State-of-the-art ...

System Design for Computation-in-Memory

From Primitive to Complex Functions

Conference paper (2022) - Mahdi Zahedi , Taha Shahroodi , Geert Custers , Abhairaj Singh , Stephan Wong , Said Hamdioui

In recent years, we are witnessing a trend moving away from conventional computer architectures towards Computation-In-Memory (CIM) based on emerging memristor devices. This is due to the fact that the performance and energy efficiency of traditional computer architectures can no ...

CIM-based Robust Logic Accelerator using 28 nm STT-MRAM Characterization Chip Tape-out

Conference paper (2022) - Abhairaj Singh , Mahdi Zahedi , Taha Shahroodi , Mohit Gupta , Anteneh Gebregiorgis , Manu Komalan , Rajiv V. Joshi , Francky Catthoor , Rajendra Bishnoi , Said Hamdioui

Spin-transfer torque magnetic random access memory (STT-MRAM) based computation-in-memory (CIM) architectures have shown great prospects for an energy-efficient computing. However, device variations and non-idealities narrow down the sensing margin that severely impacts the compu ...

MNEMOSENE

Tile Architecture and Simulator for Memristor-based Computation-in-memory

Journal article (2022) - Mahdi Zahedi , Muah Abu Lebdeh , Christopher Bengel , Dirk Wouters , Stephan Menzel , Manuel Le Gallo , Abu Sebastian , Stephan Wong , Said Hamdioui

In recent years, we are witnessing a trend toward in-memory computing for future generations of computers that differs from traditional von-Neumann architecture in which there is a clear distinction between computing and memory units. Considering that data movements between the c ...

In recent years, we are witnessing a trend toward in-memory computing for future generations of computers that differs from traditional von-Neumann architecture in which there is a clear distinction between computing and memory units. Considering that data movements between the central processing unit (CPU) and memory consume several orders of magnitude more energy compared to simple arithmetic operations in the CPU, in-memory computing will lead to huge energy savings as data no longer needs to be moved around between these units. In an initial step toward this goal, new non-volatile memory technologies, e.g., resistive RAM (ReRAM) and phase-change memory (PCM), are being explored. This has led to a large body of research that mainly focuses on the design of the memory array and its peripheral circuitry. In this article, we mainly focus on the tile architecture (comprising a memory array and peripheral circuitry) in which storage and compute operations are performed in the (analog) memory array and the results are produced in the (digital) periphery. Such an architecture is termed compute-in-memory-periphery (CIM-P). More precisely, we derive an abstract CIM-tile architecture and define its main building blocks. To bridge the gap between higher-level programming languages and the underlying (analog) circuit designs, an instruction-set architecture is defined that is intended to control and, in turn, sequence the operations within this CIM tile to perform higher-level more complex operations. Moreover, we define a procedure to pipeline the CIM-tile operations to further improve the performance. To simulate the tile and perform design space exploration considering different technologies and parameters, we introduce the fully parameterized first-of-its-kind CIM tile simulator and compiler. Furthermore, the compiler is technology-aware when scheduling the CIM-tile instructions. Finally, using the simulator, we perform several preliminary design space explorations regarding the three competing technologies, ReRAM, PCM, and STT-MRAM concerning CIM-tile parameters, e.g., the number of ADCs. Additionally, we investigate the effect of pipelining in relation to the clock speeds of the digital periphery assuming the three technologies. In the end, we demonstrate that our simulator is also capable of reporting energy consumption for each building block within the CIM tile after the execution of in-memory kernels considering the data-dependency on the energy consumption of the memory array. All the source codes are publicly available.

KrakenOnMem

A Memristor-Augmented HW/SW Framework for Taxonomic Profiling

Conference paper (2022) - Taha Shahroodi , Mahdi Zahedi , Abhairaj Singh , Stephan Wong , Said Hamdioui

State-of-the-art taxonomic profilers that comprise the first step in larger-context metagenomic studies have proven to be computationally intensive, i.e., while accurate, they come at the cost of high latency and energy consumption. Table Lookup operation is a primary bottleneck ...

Tile Architecture and Hardware Implementation for Computation-in-Memory

Conference paper (2021) - Mahdi Zahedi , Remon van Duijnen , Stephan Wong , Said Hamdioui

Computation-in-memory (CIM) shows great promise for specific applications by employing emerging (non-volatile) memory technologies such as memristors for both storage and compute, greatly reducing energy consumption, and improving performance. Based on our own observations, we ca ...

Efficient organization of digital periphery to support integer datatype for memristor-based cim

Conference paper (2020) - Mahdi Zahedi , Mahta Mayahinia , Muath Abu Lebdeh , Stephan Wong , Said Hamdioui

Von Neumann-based architectures suffer from costly communication between CPU and memory. This communication imposes several orders of magnitude more power and performance overheads compared to the arithmetic operations performed by the processor. This overhead becomes critical fo ...