Searched for: subject%3A%22Reinforcement%255C%252BLearning%22
(1 - 10 of 10)
document
Bai, Chengchao (author), Yan, Peng (author), Piao, Haiyin (author), Pan, W. (author), Guo, Jifeng (author)
This article explores deep reinforcement learning (DRL) for the flocking control of unmanned aerial vehicle (UAV) swarms. The flocking control policy is trained using a centralized-learning-decentralized-execution (CTDE) paradigm, where a centralized critic network augmented with additional information about the entire UAV swarm is utilized...
journal article 2024
document
Wan, Z. (author), Xu, Y. (author), Chang, Z. (author), Liang, M. (author), Šavija, B. (author)
Vascular self-healing concrete (SHC) has great potential to mitigate the environmental impact of the construction industry by increasing the durability of structures. Designing concrete with high initial mechanical properties by searching a specific arrangement of vascular structure is of great importance. Herein, an automatic optimization...
journal article 2024
document
Song, Yanjie (author), Ou, Junwei (author), Pedrycz, Witold (author), Suganthan, Ponnuthurai Nagaratnam (author), Wang, X. (author), Xing, Lining (author), Zhang, Yue (author)
Multitype satellite observation, including optical observation satellites, synthetic aperture radar (SAR) satellites, and electromagnetic satellites, has become an important direction in integrated satellite applications due to its ability to cope with various complex situations. In the multitype satellite observation scheduling problem ...
journal article 2024
document
Valencia Ibañez, Santiago (author)
System architecting is one of the first stages of the engineering problem-solving process. Pivotal decisions regarding the system's overall configuration are taken in this phase. Consequently, decision support tools like system architecture optimization are needed to effectively assess the architectural design space. However, system architecture...
master thesis 2023
document
Li, Guoqiang (author), Gorges, Daniel (author), Wang, M. (author)
In this paper a learning-based optimization method for online gear shift and velocity control is presented to reduce the fuel consumption and improve the driving comfort in a car-following process. The continuous traction force and the discrete gear shift are optimized jointly to improve both the powertrain operation and the longitudinal...
journal article 2022
document
van der Vlugt, Yanna (author)
Patients visiting a hospital for elective surgery often have multiple consultations with a surgeon before undergoing surgery. Hospitals discern between different types of consultations, and make a schedule allocating timeslots of outpatient department sessions to these different consultation types several weeks in advance. Changing the...
master thesis 2021
document
Schweidtmann, A.M. (author), Esche, Erik (author), Fischer, Asja (author), Kloft, Marius (author), Repke, Jens Uwe (author), Sager, Sebastian (author), Mitsos, Alexander (author)
The transformation of the chemical industry to renewable energy and feedstock supply requires new paradigms for the design of flexible plants, (bio-)catalysts, and functional materials. Recent breakthroughs in machine learning (ML) provide unique opportunities, but only joint interdisciplinary research between the ML and chemical engineering ...
review 2021
document
de Bruin, T.D. (author), Kober, J. (author), Tuyls, Karl (author), Babuska, R. (author)
Deep reinforcement learning makes it possible to train control policies that map high-dimensional observations to actions. These methods typically use gradient-based optimization techniques to enable relatively efficient learning, but are notoriously sensitive to hyperparameter choices and do not have good convergence properties. Gradient...
journal article 2020
document
de Nijs, F. (author)
Intelligent autonomous agents, designed to automate and simplify many aspects of our society, will increasingly be required to also interact with other agents autonomously. Where agents interact, they are likely to encounter resource constraints. For example, agents managing household appliances to optimize electricity usage might need to share...
doctoral thesis 2019
document
Alibekov, Eduard (author), Kubalik, Jiri (author), Babuska, R. (author)
This paper addresses the problem of deriving a policy from the value function in the context of critic-only reinforcement learning (RL) in continuous state and action spaces. With continuous-valued states, RL algorithms have to rely on a numerical approximator to represent the value function. Numerical approximation due to its nature virtually...
journal article 2018
Searched for: subject%3A%22Reinforcement%255C%252BLearning%22
(1 - 10 of 10)