Reinforcement Learning for the Knapsack Problem

Book chapter (2021)

Authors

J. Pierotti Discrete Mathematics and Optimization -

Maximilian Kronmueller

J. Alonso-Mora Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

J.T. van Essen Discrete Mathematics and Optimization -

J.W. Böhmer Algorithmics -

Research Group

Discrete Mathematics and Optimization () (TU Delft)

DOI: https://doi.org/10.1007/978-3-030-86286-2_1

Transformer Self-attention Knapsack problem Reinforcement learning End-to-end Multi-task DQN

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:50c89fbd-7270-49db-bd9c-0ef49efb66bc

Published Date

2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Delft Institute of Applied Mathematics

Research Group

Discrete Mathematics and Optimization

Abstract

Combinatorial optimization (CO) problems are at the heart of both practical and theoretical research. Due to their complexity, many problems cannot be solved via exact methods in reasonable time; hence, we resort to heuristic solution methods. In recent years, machine learning (ML) has brought immense benefits in many research areas, including heuristic solution methods for CO problems. Among ML methods, reinforcement learning (RL) seems to be the most promising method to find good solutions for CO problems. In this work, we investigate an RL framework, whose agent is based on self-attention, to achieve solutions for the knapsack problem, which is a CO problem. Our algorithm finds close to optimal solutions for instances up to one hundred items, which leads to conjecture that RL and self-attention may be major building blocks for future state-of-the-art heuristics for other CO problems.

Files

978_3_030_86286_2_1.pdf

(.pdf | 0.391 Mb)