Accelerating AI Inference in MRI Reconstruction Using Model Compression

Master Thesis (2025)
Author(s)

A. Anand (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R.F. Remis – Mentor (TU Delft - Tera-Hertz Sensing)

Emiel Hartsema – Mentor (Philips Medical Systems B. V. )

Francesco Fioranelli – Graduation committee member (TU Delft - Microwave Sensing, Signals & Systems)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
14-07-2025
Awarding Institution
Delft University of Technology
Programme
['Electrical Engineering | Microelectronics']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Magnetic Resonance Imaging (MRI) is a powerful tool for visualizing internal body structures and is widely used in clinical fields. However, MRI's long scanning times and high computational demands for post-processing pose challenges, especially in resource-limited environments. Recent advancements in machine learning, specifically model compression techniques, have offered solutions to accelerate MRI post-processing and make it more accessible.

This thesis systematically investigates the application of several model compression methods, such as low-rank factorization, knowledge distillation, and quantization, to enhance the efficiency of a baseline MR reconstruction neural network. By exploring multiple variations within each compression technique, this study evaluates their impact on key performance metrics such as inference speed, model size reduction, and reconstruction accuracy. Extensive tests show significant trade-offs between image fidelity and computational efficiency, providing insights into the practical feasibility of deploying compressed models in clinical workflows.

Among the techniques tested, low-rank factorization implemented via Tucker decomposition emerged as the most effective approach. This method achieved a threefold reduction in inference time while maintaining high reconstruction quality, highlighting its potential to improve MRI processing times in real-world applications significantly.

Files

License info not available
warning

File under embargo until 14-07-2027