Learning variable selection rules for the branch-and-bound algorithm using reinforcement learning

Master thesis (2020)

Authors

L.V. Scavuzzo Montana Electrical Engineering, Mathematics and Computer Science

Contributors

N. Yorke-Smith Algorithmics - (mentor)

K.I. Aardal Discrete Mathematics and Optimization - (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:e1c09189-0b8f-470f-be99-1e1cf04f805e

Published Date

20-01-2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Mixed Integer Linear Programming (MILP) is a generalization of classical linear programming where we restrict some (or all) variables to take integer values. Numerous real-world problems can be modeled as MILPs, such as production planning, scheduling, network design optimization and many more. MILPs are, in fact, NP-hard. State-of-the-art solvers use the branch-and-bound algorithm, an exact method that, in combination with a diverse mixture of heuristics, can tackle a fair range of practical problems. This algorithm sequentially partitions the search space using linear relaxations, thus creating a search tree. The exploration ends only when a solution, together with its proof of optimality, is found. The tree’s size can vary dramatically depending on the approach that is used to create it and explore it. One of the most influential decision-making strategies within the branch-and-bound algorithm is the branching rule, i.e., the criterion that is used to subdivide the search space. Currently, there is no mathematical understanding of this complex process. For this reason, all widely accepted branching rules are based on hand-crafted strategies which have been shown to perform well in practice. The work presented in this report is part of a blossoming line of research in the intersection of Combinatorial Optimization and Machine Learning. Specifically, we take further steps in the direction of branching rule discovery through machine learning techniques. In contrast to previously proposed methods which relied on supervised learning, we take the novel approach of leveraging a Reinforcement Learning (RL) algorithm. Our goal is to achieve a data-driven acceleration of the tree search. In this thesis, we lay the fundamental groundwork for the integration of RL into the branch-and-bound process. Through the proposed model, we gain insights on the benefits and limitations of RL, while improving on the state-of-the-art branching rules for a particular class of instances.

Files

MSc_thesis.pdf

(.pdf | 3.21 Mb)

- Embargo expired in 20-01-2021