Unlocking the Flexibility of District Heating Pipeline Energy Storage with Reinforcement Learning

None, None; None, None; None, None; None, None

Unlocking the Flexibility of District Heating Pipeline Energy Storage with Reinforcement Learning

Journal Article (2022)

Author(s)

Ksenija Stepanovic (TU Delft - Algorithmics)

Jichen Wu (Flex Technologies, TU Delft - Algorithmics)

Rob Everhardt (Flex Technologies)

Mathijs de Weerdt (TU Delft - Algorithmics)

Research Group

Algorithmics

DOI related publication

https://doi.org/10.3390/en15093290

Markov decision process Q-learning 4th generation district heating Combined heat and power economic dispatch Mixed-integer nonlinear program Pipeline energy storage

To reference this document use:

https://resolver.tudelft.nl/uuid:64dca9cc-df52-4a15-a0c8-b53538c619fa

More Info

expand_more

Publication Year

2022

Language

English

Abstract

The integration of pipeline energy storage in the control of a district heating system can lead to profit gain, for example by adjusting the electricity production of a combined heat and power (CHP) unit to the fluctuating electricity price. The uncertainty from the environment, the computational complexity of an accurate model, and the scarcity of placed sensors in a district heating system make the operational use of pipeline energy storage challenging. A vast majority of previous works determined a control strategy by a decomposition of a mixed-integer nonlinear model and significant simplifications. To mitigate consequential stability, feasibility, and computational complexity challenges, we model CHP economic dispatch as a Markov decision process. We use a reinforcement learning (RL) algorithm to estimate the system’s dynamics through interactions with the simulation environment. The RL approach is compared with a detailed nonlinear mathematical optimizer on day-ahead and real-time electricity markets and two district heating grid models. The proposed method achieves moderate profit impacted by environment stochasticity. The advantages of the RL approach are reflected in three aspects: stability, feasibility, and time scale flexibility. From this, it can be concluded that RL is a promising alternative for real-time control of complex, nonlinear industrial systems.

Files

Energies_15_03290.pdf

(pdf | 0.893 Mb)