Efficient Inference of Quantized LLMs on Edge Devices

Master Thesis (2025)
Author(s)

Z. Wang (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Q. Wang – Mentor (TU Delft - Embedded Systems)

R. Zhu – Mentor (TU Delft - Embedded Systems)

J. Yang – Graduation committee member (TU Delft - Web Information Systems)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
28-11-2025
Awarding Institution
Delft University of Technology
Programme
['Electrical Engineering | Embedded Systems']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Files

TUD_MSc_Thesis.pdf
(pdf | 0 Mb)
License info not available
warning

File under embargo until 28-11-2027