Compressing YOLOv7

Master Thesis (2023)
Author(s)

B. van Zwienen (TU Delft - Mechanical Engineering)

Contributor(s)

Martijn Wisse – Mentor (TU Delft - Robot Dynamics)

Holger Caesar – Graduation committee member (TU Delft - Intelligent Vehicles)

C. Zhang – Graduation committee member (TU Delft - Learning & Autonomous Control)

Faculty
Mechanical Engineering
Copyright
© 2023 Benjamin van Zwienen
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 Benjamin van Zwienen
Graduation Date
08-12-2023
Awarding Institution
Delft University of Technology
Programme
Mechanical Engineering
Faculty
Mechanical Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In the literature, neural network compression can significantly reduce the number of floating-point operations (FLOPs) of a neural network with limited accuracy loss. At the same time, it is common to manually design smaller networks instead of using modern compression techniques. This thesis will compare the two approaches for the object detection network YOLOv7. YOLOv7 can run in real time on a desktop GPU. For edge GPUs a smaller version, called YOLOv7-tiny, was manually designed by the authors of YOLOv7. This thesis answers the question: Can a state-of-the-art compression of YOLOv7 achieve higher accuracy than YOLOv7-tiny at the same number of floating-point operations?
First, two state-of-the-art compression methods are selected and compared on YOLOv7-tiny. Then the best performing method, GBIP, is used to compress YOLOv7 till it has the same number of FLOPs as YOLOv7-tiny. From the experiments it is determined that GBIP is not able to achieve higher accuracy than YOLOv7-tiny at the same number of FLOPs.

Files

Thesis.pdf
(pdf | 2.57 Mb)
License info not available