Robotic Packaging Optimization with Reinforcement Learning and Real-World Data

Master Thesis (2023)
Author(s)

E.A. Drijver (TU Delft - Mechanical Engineering)

Contributor(s)

C. Lieu – Mentor (TU Delft - Learning & Autonomous Control)

Rodrigo Perez-Dattari – Mentor (TU Delft - Learning & Autonomous Control)

Zlatan Ajanovic – Mentor (TU Delft - Learning & Autonomous Control)

J. Kober – Graduation committee member (TU Delft - Learning & Autonomous Control)

M. Hulscher – Mentor (BluePrint Automation )

Faculty
Mechanical Engineering
Copyright
© 2023 Eveline Drijver
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 Eveline Drijver
Graduation Date
09-03-2023
Awarding Institution
Delft University of Technology
Programme
['Mechanical Engineering | Biomechanical Design - BioRobotics']
Faculty
Mechanical Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Intelligent manufacturing has become increasingly important in the food packaging industry due to the growing demand for enhanced productivity and flexibility while minimizing waste and lead times. This work explores the integration of such manufacturing in automated secondary robotic food packaging solutions that transfer food products into containers using pick-and-place robots. A major problem in these solutions is varying product supply caused by prior machinery. As a result productivity drops drastically when conveyor belt speeds are not optimally controlled. Conventional heuristic-based engineered approaches are used to address this issue but are inadequate, leading to noncompliance with industry's requirements. Reinforcement learning, on the other hand, has the potential of solving this problem by learning quick and predictive decision-making behavior based on experience. However, the lack of research in reinforcement learning for complex industrial robotic problems, limits its adoption in industry. Therefore, this work aims to investigate the feasibility of reinforcement learning in the robotic packaging industry. We propose a reinforcement learning framework, with policy inference in a highly complex control scheme, designed to optimize the conveyor belt speed of the secondary robotic packaging solution using real-world product supply data. The framework exceeds the 99.8 percent performance requirement and maintains quality at the required 100 percent when tested on real-world data. Compared to the current heuristic-based solution, our proposed framework improves productivity, has smoother control and reduces code execution time.

Files

MSc_Thesis_Paper_Eveline_Drijv... (pdf)
(pdf | 30.4 Mb)
- Embargo expired in 02-03-2025
License info not available