Global synchromodal shipment matching problem with dynamic and stochastic travel times

None, None; None, None; None, None

Global synchromodal shipment matching problem with dynamic and stochastic travel times

a reinforcement learning approach

Journal Article (2022)

Author(s)

W. Guo (University of Quebec)

B. Atasoy (TU Delft - Transport Engineering and Logistics)

R. R. Negenborn (TU Delft - Transport Engineering and Logistics)

Research Group

Transport Engineering and Logistics

DOI related publication

https://doi.org/10.1007/s10479-021-04489-z

Reinforcement learning Q-learning Dynamic and stochastic travel times Global synchromodal shipment matching Sequential decision process

To reference this document use:

https://resolver.tudelft.nl/uuid:c376a29a-5e29-464c-8fec-addc1a1c05d2

More Info

expand_more

Publication Year

2022

Language

English

Research Group

Transport Engineering and Logistics

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Issue number

1

Volume number

350 (2025)

Pages (from-to)

63-94

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Global synchromodal transportation involves the movement of container shipments between inland terminals located in different continents using ships, barges, trains, trucks, or any combination among them through integrated planning at a network level. One of the challenges faced by global operators is the matching of accepted shipments with services in an integrated global synchromodal transport network with dynamic and stochastic travel times. The travel times of services are unknown and revealed dynamically during the execution of transport plans, but the stochastic information of travel times are assumed available. Matching decisions can be updated before shipments arrive at their destination terminals. The objective of the problem is to maximize the total profits that are expressed in terms of a combination of revenues, travel costs, transfer costs, storage costs, delay costs, and carbon tax over a given planning horizon. We propose a sequential decision process model to describe the problem. In order to address the curse of dimensionality, we develop a reinforcement learning approach to learn the value of matching a shipment with a service through simulations. Specifically, we adopt the Q-learning algorithm to update value function estimations and use the ϵ-greedy strategy to balance exploitation and exploration. Online decisions are created based on the estimated value functions. The performance of the reinforcement learning approach is evaluated in comparison to a myopic approach that does not consider uncertainties and a stochastic approach that sets chance constraints on feasible transshipment under a rolling horizon framework.

Files

Guo2022_Article_GlobalSynchrom... (pdf)

(pdf | 1.88 Mb)

- Embargo expired in 21-07-2022

License info not available