Learning heuristics for a supply planner

Master thesis (2022)

Authors

K. Chong Electrical Engineering, Mathematics and Computer Science

Contributors

M.M. de Weerdt Algorithmics - (mentor)

K.S. Postek Discrete Mathematics and Optimization - (mentor)

E.A.T. Julien Discrete Mathematics and Optimization - (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

To reference this document use:

http://resolver.tudelft.nl/uuid:31823048-b211-42b6-9b07-51e5cb1a62d5

More Info

expand_more

Published Date

30-05-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Supply planning is an NP-Hard problem that is often tackled when dealing with supply chain management. It is a problem with many variations but the core idea is to create a plan that resolves as much demand as possible.
There are different approaches in use to solve a planning problem, one of which is by using heuristics. Heuristics are used to get to a relatively good solution within some feasible amount of time compared to trying to get to an optimal solution. Expertise of the problem is often required to come up with good heuristics or to improve existing ones. When such an expert is not readily available alternatives need to be considered, one of which is machine learning.
The goal of this thesis is to create a machine learning model that can improve the heuristics used in an existing planning algorithm at Outperform. This heuristic decides upon a sequence in which products are included in the planning algorithm. Using a different sequence can lead to vastly different results for the algorithm both in quality and time required. Our focus will be on reducing the amount of time required for the algorithm by learning a heuristic that can provide better sequences.
To achieve this goal, two models are created and evaluated. These models use imitation learning to replicate the sequences that were the fastest. Due to the lack of an expert to provide the fastest sequences, an oracle is created that attempts to search for a sequence as fast as possible within some feasible amount of time. The trained models are evaluated on several supply chains provided by Outperform.

Files

MSc_Thesis.pdf

(.pdf | 3.1 Mb)