 document

Lubbers, Seymour (author)Greenhouses allow production of crops that would otherwise be impossible. Permitting more local, fresher and nutrient richer crop production. Eorts are taken to minimize societal harm due to energy and resource consumption by greenhouse production systems. One way to control such systems is by using model predictive control. Optimal crop yield...master thesis 2023
 document

Dai, Pengcheng (author), Yu, Wenwu (author), Wang, He (author), Baldi, S. (author)Actorcritic (AC) cooperative multiagent reinforcement learning (MARL) over directed graphs is studied in this article. The goal of the agents in MARL is to maximize the globally averaged return in a distributed way, i.e., each agent can only exchange information with its neighboring agents. AC methods proposed in the literature require the...journal article 2023
 document

Zhou, Y. (author), Ho, H.W. (author)Hierarchical Reinforcement Learning (HRL) provides an option to solve complex guidance and navigation problems with highdimensional spaces, multiple objectives, and a large number of states and actions. The current HRL methods often use the same or similar reinforcement learning methods within one application so that multiple objectives can...journal article 2022
 document

Meyer, Johann (author)Aircraft are complex systems with, in some cases, highdimensional nonlinear interactions between control surfaces. When a failure occurs, adaptive flight control methods can be utilised to stabilise and make the aircraft controllable. Adaptive flight control methods, however, require accurate aerodynamic models  where firstorder continuity is...master thesis 2021
 document

Becker, Midas (author)<br/>Being a safe and healthy alternative for polluting and spaceinefficient motorised vehicles, cycling can strongly improve living conditions in urban areas. Idling in front of traffic lights is seen as one of the major inconveniences of commuting by bicycle. By giving personalised speed advice, the probability of catching a green light can...master thesis 2021
 document

Karagöz, Ridvan (author)Bsplines are basis functions for the spline function space and are extensively used in applications requiring function approximation. The generalization of Bsplines to multiple dimensions is done through tensor products of their univariate basis functions. The number of basis functions and weights that define a multivariate Bspline surface,...master thesis 2020
 document

Dai, Pengcheng (author), Yu, Wenwu (author), Wen, Guanghui (author), Baldi, S. (author)In this article, the dynamic economic dispatch (DED) problem for smart grid is solved under the assumption that no knowledge of the mathematical formulation of the actual generation cost functions is available. The objective of the DED problem is to find the optimal power output of each unit at each time so as to minimize the total generation...journal article 2020
 document

Dorscheidt, Joost (author)Reinforcement Learning (RL) is a learning paradigm that learns by interacting with the environment. In practice, a RL agent needs to perform many actions to sample rewards and state transitions from their environments. Recent advances in using deep neural networks as function approximators reduce the sample complexity in very high dimensional...master thesis 2018
 document

Buşoniu, Lucian (author), de Bruin, T.D. (author), Tolić, Domagoj (author), Kober, J. (author), Palunko, Ivana (author)Reinforcement learning (RL) offers powerful algorithms to search for optimal controllers of systems with nonlinear, possibly stochastic dynamics that are unknown or highly uncertain. This review mainly covers artificialintelligence approaches to RL, from the viewpoint of the control engineer. We explain how approximate representations of the...review 2018
 document

Langenkamp, W.H. (author)Reinforcement learning is a machine learning paradigm that deals with optimisation and learns by interacting with its environment. Tabular reinforcement learning methods are popular because of their relative simplicity combined with good guarantees of finding an optimal solution. The downside is that they suffer from an exponentially growing...master thesis 2016
 document

De Visser, C.C. (author), Chu, Q.P. (author), Mulder, J.A. (author)The ability to perform online model identification for nonlinear systems with unknown dynamics is essential to any adaptive modelbased control system. In this paper, a new differential equality constrained recursive least squares estimator for multivariate simplex splines is presented that is able to perform online model identification and...journal article 2011