Residential demand response of thermostatically controlled loads using batch Reinforcement Learning

Journal Article (2017)
Research Group
Team Bart De Schutter
Copyright
© 2017 F Ruelens, BJ Claessens, S Vandael, B.H.K. De Schutter, R. Babuska, R Belmans
DOI related publication
https://doi.org/10.1109/TSG.2016.2517211
More Info
expand_more
Publication Year
2017
Language
English
Copyright
© 2017 F Ruelens, BJ Claessens, S Vandael, B.H.K. De Schutter, R. Babuska, R Belmans
Research Group
Team Bart De Schutter
Issue number
5
Volume number
8
Pages (from-to)
2149-2159
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Driven by recent advances in batch Reinforcement Learning (RL), this paper contributes to the application of batch RL to demand response. In contrast to conventional model-based approaches, batch RL techniques do not require a system identification step, making them more suitable for a large-scale implementation. This paper extends fitted Q-iteration, a standard batch RL technique, to the situation when a forecast of the exogenous data is provided. In general, batch RL techniques do not rely on expert knowledge about the system dynamics or the solution. However, if some expert knowledge is provided, it can be incorporated by using the proposed policy adjustment method. Finally, we tackle the challenge of finding an open-loop schedule required to participate in the day-ahead market. We propose a model-free Monte Carlo method that uses a metric based on the state-action value function or Q-function and we illustrate this method by finding the day-ahead schedule of a heat-pump thermostat. Our experiments show that batch RL techniques provide a valuable alternative to model-based controllers and that they can be used to construct both closed-loop and open-loop policies.

Files

07401112_1_3.pdf
(pdf | 1.38 Mb)
License info not available