Optimal Energy Scheduling of Flexible Industrial Prosumers via Reinforcement Learning
Nick van den Bovenkamp (Student TU Delft, Sunrock Investments B.V)
Juan S. Giraldo (TNO)
Edgar Mauricio Salazar (Eindhoven University of Technology)
Pedro P. Vergara Barrios (TU Delft - Intelligent Electrical Power Grids)
Charalambos Konstantinou (King Abdullah University of Science and Technology (KAUST))
P Palensky (TU Delft - Electrical Sustainable Energy)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
This paper introduces an energy management system (EMS) aiming to minimize electricity operating costs using reinforcement learning (RL) with a linear function approximation. The proposed EMS uses a Q-learning with tile coding (QLTC) algorithm and is compared to a deterministic mixed-integer linear programming (MILP) with perfect forecast information. The comparison is performed using a case study on an industrial manufacturing company in the Netherlands, considering measured electricity consumption, PV generation, and wholesale electricity prices during one week of operation. The results show that the proposed EMS can adjust the prosumer's power consumption considering favorable prices. The electricity costs obtained using the QLTC algorithm are 99% close to those obtained with the MILP model. Furthermore, the results demonstrate that the QLTC model can generalize on previously learned control policies even in the case of missing data and can deploy actions 80% near to the MILP's optimal solution.