Solving the Online 3D Bin Packing Problem with Graph-Based Reinforcement Learning

Master thesis (2024)

Authors

G. Corvi Mechanical Engineering

Contributors

C. Della Santina Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering (supervisor 1)

Ronald Poelman (supervisor 2)

M. Wisse Robot Dynamics - Mechanical, Maritime and Materials Engineering (supervisor 2)

Faculty

Mechanical Engineering

Reinforcement Learning Bin Packing Graph Neural Networks Warehouse automation

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:81c7d858-88f6-4c39-9a99-c09ec6128e08

Published Date

25-04-2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical Engineering

Abstract

The rapidly growing volume of parcel shipments is straining transportation and logistics sectors, highlighting the need for innovative solutions to optimize packing and loading processes. The online bin packing problem (BPP), an NP-hard computational problem, finds practical applications in numerous sectors, including modern packaging and intelligent logistics. This study proposes a novel reinforcement learning (RL) approach to tackle the online 3D-BPP emphasizing applicability and versatility. The key innovation is the representation of the packing scene as a graph, enabling effective encoding of task-specific high-level features. This graph-based structure serves as the foundation for an RL agent designed to learn an optimal packing strategy through dynamic interaction with the environment. The proposed approach uniquely operates within the continuous domain, enhancing generalization across diverse packing tasks. Experimental evaluations in both simulated environments and a real-world setting demonstrate that the solution achieves state-of-the-art performance across multiple complex three-dimensional packing scenarios.

Files

ThesisReport_GiovanniCorvi_560... (.pdf)

(.pdf | 22 Mb)