Print Email Facebook Twitter Mean Field Multi Agent Reinforcement Learning for Active Wake Control Title Mean Field Multi Agent Reinforcement Learning for Active Wake Control Author Plămădeală, Ion (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Neustroev, G. (mentor) de Weerdt, M.M. (mentor) Pawełczak, Przemysław (graduation committee) Degree granting institution Delft University of Technology Corporate name Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2023-06-30 Abstract The wake effect which is turbulence behind a wind turbine created when it extracts energy negatively impacts the power output of the downstream turbines. Active Wake Control can mitigate this effect, by rotating some turbines away from the wind. Previous research applied single agent reinforcement learning to apply Active Wake Control, show- ing good results for small-scale layouts, that don’t scale for larger, practical wind farms. To that extent, this study focuses on the application of mean-field multi-agent reinforcement learning to Active Wake Control, under constant wind conditions. This algorithm limits the computations to a limited set of neighbouring turbines, reducing their complexities. To build the answer to this question I will also study:1. how to model the rewards to solve the lazy- agent problem, leveraging the nature of the Active Wake Control2. how the view of the agent changes the results3. how does it compare to a single-agent reinforcement learning algorithm, TD3The experiments were done using the Floris Wake Simulator, with each turbine sharing the same agent, placed in tunnel layouts at real-life distances (6-7 rotor diameters), under constant wind conditions.Results show that with the proper configuration of rewards and view space within wind tunnels, the mean-field algorithm finds near optimal configurations for Active Wake Control, within a small number of episodes. This shows a promising start for the application of mean-field multi-agent algorithms for the Active Wake Control problem, and provides insight into how to model the rewards, which might be applicable for the whole class of algorithms. Subject Mean-Field Multi Agent Reinforcement LearningActive Wake ControlReward functionObservation view To reference this document use: http://resolver.tudelft.nl/uuid:5cf1f1a6-e306-4c1c-9a9b-f4c2de104065 Part of collection Student theses Document type bachelor thesis Rights © 2023 Ion Plămădeală Files PDF plamadeala_thesis.pdf 470.66 KB Close viewer /islandora/object/uuid:5cf1f1a6-e306-4c1c-9a9b-f4c2de104065/datastream/OBJ/view