Search results | TU Delft Repositories

document

An adaptive agent-based approach for instant delivery order dispatching: Incorporating task buffering and dynamic batching strategies

Lu, Miaojia (author), Yan, Xinyu (author), Sharif Azadeh, S. (author), Wang, P. (author)

The volume of instant delivery has witnessed a significant growth in recent years. Given the involvement of numerous heterogeneous stakeholders, instant delivery operations are inherently characterized by dynamics and uncertainties. This study introduces two order dispatching strategies, namely task buffering and dynamic batching, as...

journal article 2024

document

Generalized Model and Deep Reinforcement Learning-Based Evolutionary Method for Multitype Satellite Observation Scheduling

Song, Yanjie (author), Ou, Junwei (author), Pedrycz, Witold (author), Suganthan, Ponnuthurai Nagaratnam (author), Wang, X. (author), Xing, Lining (author), Zhang, Yue (author)

Multitype satellite observation, including optical observation satellites, synthetic aperture radar (SAR) satellites, and electromagnetic satellites, has become an important direction in integrated satellite applications due to its ability to cope with various complex situations. In the multitype satellite observation scheduling problem ...

journal article 2024

document

Synergetic-informed deep reinforcement learning for sustainable management of transportation networks with large action spaces

Lai, Li (author), Dong, You (author), Andriotis, C. (author), Wang, Aijun (author), Lei, Xiaoming (author)

Effective transportation network management systems should consider safety and sustainability objectives. Existing research on large-scale transportation network management often employs the assumption that bridges can be considered individually under these objectives. However, this simplification misses accurate system-level representations,...

journal article 2024

document

Distributed Actor-Critic Algorithms for Multiagent Reinforcement Learning Over Directed Graphs

Dai, Pengcheng (author), Yu, Wenwu (author), Wang, He (author), Baldi, S. (author)

Actor-critic (AC) cooperative multiagent reinforcement learning (MARL) over directed graphs is studied in this article. The goal of the agents in MARL is to maximize the globally averaged return in a distributed way, i.e., each agent can only exchange information with its neighboring agents. AC methods proposed in the literature require the...

journal article 2023

document

Towards Efficient Personalized Driver Behavior Modeling with Machine Unlearning

Song, Q. (author), Tan, Rui (author), Wang, J. (author)

Driver Behavior Modeling (DBM) aims to predict and model human driving behaviors, which is typically incorporated into the Advanced Driver Assistance System to enhance transportation safety and improve driving experience. Inverse reinforcement learning (IRL) is a prevailing DBM technique with the goal of modeling the driving policy by...

conference paper 2023

document

Online Optimization of Gear Shift and Velocity for Eco-Driving using Adaptive Dynamic Programming

Li, Guoqiang (author), Gorges, Daniel (author), Wang, M. (author)

In this paper a learning-based optimization method for online gear shift and velocity control is presented to reduce the fuel consumption and improve the driving comfort in a car-following process. The continuous traction force and the discrete gear shift are optimized jointly to improve both the powertrain operation and the longitudinal...

journal article 2022

document

A Looseness Detection Method for Railway Catenary Fasteners based on Reinforcement Learning Refined Localization

Zhong, Junping (author), Liu, Zhigang (author), Wang, H. (author), Liu, Wenqiang (author), Yang, Cheng (author), Han, Zhiwei (author), Nunez, Alfredo (author)

Brace sleeve (BS) fasteners, i.e., nut and bolt, are small components but play essential roles in fixing BS and cantilever in railway catenary system. They are commonly inspected by onboard cameras using computer vision to ensure the safety of railway operation. However, most BS fasteners cannot be directly localized because they are too...

journal article 2021

document

Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee

Han, Minghao (author), Tian, Yuan (author), Zhang, Lixian (author), Wang, J. (author), Pan, W. (author)

Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. Without using a mathematical model, an optimal controller can be learned from data evaluated by certain performance criteria through trial-and-error. However, the data-based learning approach is notorious for not guaranteeing stability, which is...

journal article 2021

document

Reinforcement Learning in Railway Timetable Rescheduling

Zhu, Y. (author), Wang, H. (author), Goverde, R.M.P. (author)

Real-time railway traffic management is important for the daily operations of railway systems. It predicts and resolves operational conflicts caused by events like excessive passenger boardings/alightings. Traditional optimization methods for this problem are restricted by the size of the problem instances. Therefore, this paper proposes a...

conference paper 2020

document

Action-driven Reinforcement Learning for Improving Localization of Brace Sleeve in Railway Catenary

Zhong, J. (author), Liu, Zhigang (author), Wang, H. (author), Liu, W. (author), Yang, Cheng (author), Nunez, Alfredo (author)

Brace Sleeve (BS) plays an essential role in connecting and fixing cantilevers of railway catenary systems. It needs to be monitored to ensure the safety of railway operations. In the literature, image processing techniques that can localize BSs from inspection images are proposed. However, the boxes produced by existing methods can contain...

conference paper 2020

document

A method of personalized driving decision for smart car based on deep reinforcement learning

Wang, X. (author), Wu, Chaozhong (author), Xue, J. (author), Chen, Z. (author)

To date, automatic driving technology has become a hotspot in academia. It is necessary to provide a personalization of automatic driving decision for each passenger. The purpose of this paper is to propose a self-learning method for personalized driving decisions. First, collect and analyze driving data from different drivers to set learning...

journal article 2020

document

Use of Affordances for Efficient Robot Learning

Wang, C. (author)

doctoral thesis 2017