Accelerating AI Inference in MRI Reconstruction Using Model Compression

None, None

Accelerating AI Inference in MRI Reconstruction Using Model Compression

Master Thesis (2025)

Author(s)

A. Anand (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R.F. Remis – Mentor (TU Delft - Tera-Hertz Sensing)

Emiel Hartsema – Mentor (Philips Medical Systems B. V. )

Francesco Fioranelli – Graduation committee member (TU Delft - Microwave Sensing, Signals & Systems)

Faculty

Electrical Engineering, Mathematics and Computer Science

Knowledge Distillation MRI Model Compression Low-rank factorization Inference time reduction

To reference this document use:

https://resolver.tudelft.nl/uuid:e671c1af-653f-4225-bfbb-559a7f5a7013

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

14-07-2025

Awarding Institution

Delft University of Technology

Programme

['Electrical Engineering | Microelectronics']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Magnetic Resonance Imaging (MRI) is a powerful tool for visualizing internal body structures and is widely used in clinical fields. However, MRI's long scanning times and high computational demands for post-processing pose challenges, especially in resource-limited environments. Recent advancements in machine learning, specifically model compression techniques, have offered solutions to accelerate MRI post-processing and make it more accessible.

This thesis systematically investigates the application of several model compression methods, such as low-rank factorization, knowledge distillation, and quantization, to enhance the efficiency of a baseline MR reconstruction neural network. By exploring multiple variations within each compression technique, this study evaluates their impact on key performance metrics such as inference speed, model size reduction, and reconstruction accuracy. Extensive tests show significant trade-offs between image fidelity and computational efficiency, providing insights into the practical feasibility of deploying compressed models in clinical workflows.

Among the techniques tested, low-rank factorization implemented via Tucker decomposition emerged as the most effective approach. This method achieved a threefold reduction in inference time while maintaining high reconstruction quality, highlighting its potential to improve MRI processing times in real-world applications significantly.

Files

Thesis_Anjali_Anand.pdf

(pdf | 0 Mb)

License info not available

File under embargo until 14-07-2027