Searched for: subject%3A%22Reinforcement%255C+Learning%22
(21 - 40 of 445)

Pages

document
Holman, Storm (author)
In response to the increasing challenges of Cyber Electromagnetic Activities (CEMA) in urban settings, characterized by dense electromagnetic (EM) signals and rising data traffic, this research introduces an Agent-Based Model (ABM) aimed at prioritizing critical signals. The primary goal of this research is to deploy a Unmanned Aerial Vehicle ...
master thesis 2024
document
Piccini, Pietro (author)
ncentive-based demand response (iDR) programs serve as important tools for distributed system operators (DSOs) to achieve a reduction in electricity demand during periods of grid overload. During these programs, participants can decide to curtail their consumption in exchange for financial incentives. Deciding the amount of curtailment for a...
master thesis 2024
document
Wan, Z. (author), Xu, Y. (author), Chang, Z. (author), Liang, M. (author), Šavija, B. (author)
Vascular self-healing concrete (SHC) has great potential to mitigate the environmental impact of the construction industry by increasing the durability of structures. Designing concrete with high initial mechanical properties by searching a specific arrangement of vascular structure is of great importance. Herein, an automatic optimization...
journal article 2024
document
He, K. (author), Shi, S. (author), van den Boom, A.J.J. (author), De Schutter, B.H.K. (author)
Approximate dynamic programming (ADP) faces challenges in dealing with constraints in control problems. Model predictive control (MPC) is, in comparison, well-known for its accommodation of constraints and stability guarantees, although its computation is sometimes prohibitive. This paper introduces an approach combining the two methodologies...
journal article 2024
document
Lu, Miaojia (author), Yan, Xinyu (author), Sharif Azadeh, S. (author), Wang, P. (author)
The volume of instant delivery has witnessed a significant growth in recent years. Given the involvement of numerous heterogeneous stakeholders, instant delivery operations are inherently characterized by dynamics and uncertainties. This study introduces two order dispatching strategies, namely task buffering and dynamic batching, as...
journal article 2024
document
Groot, D.J. (author), Ellerbroek, Joost (author), Hoekstra, J.M. (author)
Conventional Air Traffic Control is still predominantly being done by human Air Traffic Controllers, however, as the traffic density increases, the workload of the controllers increases as well. Especially for the area of unmanned aviation, driven by the rise in drones, having human controllers might become unfeasible. One of the methods that...
journal article 2024
document
Tseremoglou, I. (author), Santos, Bruno F. (author)
In the Condition-Based Maintenance (CBM) context, the definition of optimal maintenance plans for an aircraft fleet depends on an efficient integration of : (i) the probabilistic predictions of the health condition of the components and (ii) the stochastic arrival of the corrective maintenance tasks, together with consideration of the...
journal article 2024
document
Li, Siyue (author), Zhou, Shize (author), Xue, Yongqi (author), Fan, Wenjie (author), Cheng, Tong (author), Ji, Jinlun (author), Dai, Chenyang (author), Song, Wenqing (author), Gao, C. (author)
Network-on-Chip (NoC) is a scalable on-chip communication architecture for the NN accelerator, but with the increase in the number of nodes, the communication delay becomes higher. Applications such as machine learning have a certain resilience to noisy/erroneous transmitted data. Therefore, approximate communication becomes a promising solution...
journal article 2024
document
Song, Yanjie (author), Ou, Junwei (author), Pedrycz, Witold (author), Suganthan, Ponnuthurai Nagaratnam (author), Wang, X. (author), Xing, Lining (author), Zhang, Yue (author)
Multitype satellite observation, including optical observation satellites, synthetic aperture radar (SAR) satellites, and electromagnetic satellites, has become an important direction in integrated satellite applications due to its ability to cope with various complex situations. In the multitype satellite observation scheduling problem ...
journal article 2024
document
Cheng, Ji (author), Xue, Bo (author), Jiaxiang, Y. (author), Zhang, Qingfu (author)
Multi-objective Stochastic Linear bandit (MOSLB) plays a critical role in the sequential decision-making paradigm, however, most existing methods focus on the Pareto dominance among different objectives without considering any priority. In this paper, we study bandit algorithms under mixed Pareto-lexicographic orders, which can reflect...
journal article 2024
document
Yao, X. (author), Du, Zhaocheng (author), Sun, Zhanbo (author), Calvert, S.C. (author), ji, Ang (author)
Deep Reinforcement Learning (DRL) has made remarkable progress in autonomous vehicle decision-making and execution control to improve traffic performance. This paper introduces a DRL-based mechanism for cooperative lane changing in mixed traffic (CLCMT) for connected and automated vehicles (CAVs). The uncertainty of human-driven vehicles (HVs...
journal article 2024
document
Lai, Li (author), Dong, You (author), Andriotis, C. (author), Wang, Aijun (author), Lei, Xiaoming (author)
Effective transportation network management systems should consider safety and sustainability objectives. Existing research on large-scale transportation network management often employs the assumption that bridges can be considered individually under these objectives. However, this simplification misses accurate system-level representations,...
journal article 2024
document
Murti, Fahri Wisnu (author), Ali, Samad (author), Iosifidis, G. (author), Latva-aho, Matti (author)
Virtualized Radio Access Networks (vRANs) are fully configurable and can be implemented at a low cost over commodity platforms to enable network management flexibility. In this paper, a novel vRAN reconfiguration problem is formulated to jointly reconfigure the functional splits of the base stations (BSs), locations of the virtualized central...
journal article 2024
document
Dierikx, M. (author), Albers, N. (author), Scheltinga, Bouke (author), Brinkman, W.P. (author)
Goal-setting is commonly used in behavior change applications for physical activity. However, for goals to be effective, they need to be tailored to a user’s situation (e.g., motivation, progress). One way to obtain such goals is a collaborative process in which a healthcare professional and client set a goal together, thus making use of the...
conference paper 2024
document
Zhang, Zheng (author), Zhang, Dengyu (author), Zhang, Qingrui (author), Pan, W. (author), Hu, Tianjiang (author)
Integrating rule-based policies into reinforcement learning promises to improve data efficiency and generalization in cooperative pursuit problems. However, most implementations do not properly distinguish the influence of neighboring robots in observation embedding or inter-robot interaction rules, leading to information loss and inefficient...
journal article 2024
document
Ni, Y. (author), Knoop, V.L. (author), Kooij, J.F.P. (author), van Arem, B. (author)
A substantial number of vehicles nowadays are equipped with adaptive cruise control (ACC), which adjusts the vehicle speed automatically. However, experiments have found that commercial ACC systems which only detect the direct leader amplify the propagating disturbances in the platoon. This can cause severe traffic congestion when the number...
journal article 2024
document
Bai, Chengchao (author), Yan, Peng (author), Piao, Haiyin (author), Pan, W. (author), Guo, Jifeng (author)
This article explores deep reinforcement learning (DRL) for the flocking control of unmanned aerial vehicle (UAV) swarms. The flocking control policy is trained using a centralized-learning-decentralized-execution (CTDE) paradigm, where a centralized critic network augmented with additional information about the entire UAV swarm is utilized...
journal article 2024
document
Wei, Zeyong (author), Chen, Honghua (author), Nan, L. (author), Wang, Jun (author), Qin, Jing (author), Wei, Mingqiang (author)
Current point cloud denoising (PCD) models optimize single networks, trying to make their parameters adaptive to each point in a large pool of point clouds. Such a denoising network paradigm neglects that different points are often corrupted by different levels of noise and they may convey different geometric structures. Thus, the intricacy of...
journal article 2024
document
van der Spaa, L.F. (author), Kober, J. (author), Gienger, Michael (author)
The advent of collaborative robots allows humans and robots to cooperate in a direct and physical way. While this leads to amazing new opportunities to create novel robotics applications, it is challenging to make the collaboration intuitive for the human. From a system’s perspective, understanding the human intentions seems to be one...
journal article 2024
document
Xue, Chenbao (author), Cai, Han (author), Gehly, S. (author), Jah, Moriba (author), Zhang, Jingrui (author)
To ensure the secure operation of space assets, it is crucial to employ ground and/or space-based surveillance sensors to observe a diverse array of anthropogenic space objects (ASOs). This enables the monitoring of abnormal behavior and facilitates the timely identification of potential risks, thereby enabling the provision of continuous and...
review 2024
Searched for: subject%3A%22Reinforcement%255C+Learning%22
(21 - 40 of 445)

Pages