Search results | TU Delft Repositories

document

Learning-Based Multi-UAV Flocking Control With Limited Visual Field and Instinctive Repulsion

Bai, Chengchao (author), Yan, Peng (author), Piao, Haiyin (author), Pan, W. (author), Guo, Jifeng (author)

This article explores deep reinforcement learning (DRL) for the flocking control of unmanned aerial vehicle (UAV) swarms. The flocking control policy is trained using a centralized-learning-decentralized-execution (CTDE) paradigm, where a centralized critic network augmented with additional information about the entire UAV swarm is utilized...

journal article 2024

document

Automatic enhancement of vascular configuration for self-healing concrete through reinforcement learning approach

Wan, Z. (author), Xu, Y. (author), Chang, Z. (author), Liang, M. (author), Šavija, B. (author)

Vascular self-healing concrete (SHC) has great potential to mitigate the environmental impact of the construction industry by increasing the durability of structures. Designing concrete with high initial mechanical properties by searching a specific arrangement of vascular structure is of great importance. Herein, an automatic optimization...

journal article 2024

document

Generalized Model and Deep Reinforcement Learning-Based Evolutionary Method for Multitype Satellite Observation Scheduling

Song, Yanjie (author), Ou, Junwei (author), Pedrycz, Witold (author), Suganthan, Ponnuthurai Nagaratnam (author), Wang, X. (author), Xing, Lining (author), Zhang, Yue (author)

Multitype satellite observation, including optical observation satellites, synthetic aperture radar (SAR) satellites, and electromagnetic satellites, has become an important direction in integrated satellite applications due to its ability to cope with various complex situations. In the multitype satellite observation scheduling problem ...

journal article 2024

document

Optimization Strategies for System Architecting Problems

Valencia Ibañez, Santiago (author)

System architecting is one of the first stages of the engineering problem-solving process. Pivotal decisions regarding the system's overall configuration are taken in this phase. Consequently, decision support tools like system architecture optimization are needed to effectively assess the architectural design space. However, system architecture...

master thesis 2023

document

Online Optimization of Gear Shift and Velocity for Eco-Driving using Adaptive Dynamic Programming

Li, Guoqiang (author), Gorges, Daniel (author), Wang, M. (author)

In this paper a learning-based optimization method for online gear shift and velocity control is presented to reduce the fuel consumption and improve the driving comfort in a car-following process. The continuous traction force and the discrete gear shift are optimized jointly to improve both the powertrain operation and the longitudinal...

journal article 2022

document

Timeslot allocation for waiting list control: Tactical planning of orthopaedic surgeons at the Sint Maartenskliniek

van der Vlugt, Yanna (author)

Patients visiting a hospital for elective surgery often have multiple consultations with a surgeon before undergoing surgery. Hospitals discern between different types of consultations, and make a schedule allocating timeslots of outpatient department sessions to these different consultation types several weeks in advance. Changing the...

master thesis 2021

document

Machine Learning in Chemical Engineering: A Perspective

Schweidtmann, A.M. (author), Esche, Erik (author), Fischer, Asja (author), Kloft, Marius (author), Repke, Jens Uwe (author), Sager, Sebastian (author), Mitsos, Alexander (author)

The transformation of the chemical industry to renewable energy and feedstock supply requires new paradigms for the design of flexible plants, (bio-)catalysts, and functional materials. Recent breakthroughs in machine learning (ML) provide unique opportunities, but only joint interdisciplinary research between the ML and chemical engineering ...

review 2021

document

Fine-tuning deep RL with gradient-free optimization

de Bruin, T.D. (author), Kober, J. (author), Tuyls, Karl (author), Babuska, R. (author)

Deep reinforcement learning makes it possible to train control policies that map high-dimensional observations to actions. These methods typically use gradient-based optimization techniques to enable relatively efficient learning, but are notoriously sensitive to hyperparameter choices and do not have good convergence properties. Gradient...

journal article 2020

document

Resource-constrained Multi-agent Markov Decision Processes

de Nijs, F. (author)

Intelligent autonomous agents, designed to automate and simplify many aspects of our society, will increasingly be required to also interact with other agents autonomously. Where agents interact, they are likely to encounter resource constraints. For example, agents managing household appliances to optimize electricity usage might need to share...

doctoral thesis 2019

document

Policy derivation methods for critic-only reinforcement learning in continuous spaces

Alibekov, Eduard (author), Kubalik, Jiri (author), Babuska, R. (author)

This paper addresses the problem of deriving a policy from the value function in the context of critic-only reinforcement learning (RL) in continuous state and action spaces. With continuous-valued states, RL algorithms have to rely on a numerical approximator to represent the value function. Numerical approximation due to its nature virtually...

journal article 2018