L. Benini

Master thesis (1)

1 records found

MXITA: Design and Implementation of Microscaling Integer Accelerator for Neural Networks

An exploration of multidimensional systolic arrays

Master thesis (2025) - L.O. Hu (author) , GN Gaydadjiev (mentor) , G. Islamoglu (mentor) , P. Wiese (mentor) , L. Benini (mentor) , J.S.S.M. Wong (graduation committee member) , Charlotte Frenkel (graduation committee member)

The rapid growth of deep learning models, particularly Transformers, has far outpaced hardware scaling, increasing pressure on memory and compute efficiency. While INT8 quantization reduces memory requirements, it often sacrifices accuracy. Microscaling (MX) formats, such as MXIN ...