Physics-informed distributed reinforcement learning for privacy-aware voltage regulation using local smart meter data

Journal Article (2026)
Author(s)

Dong Liu (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Juan S. Giraldo (Netherlands Organisation for Applied Scientific Research)

Peter Palensky (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Pedro P. Vergara (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Intelligent Electrical Power Grids
DOI related publication
https://doi.org/10.1016/j.egyai.2026.100768 Final published version
More Info
expand_more
Publication Year
2026
Language
English
Research Group
Intelligent Electrical Power Grids
Journal title
Energy and AI
Volume number
24
Article number
100768
Downloads counter
7
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Centralized reinforcement learning-based voltage regulation in distribution networks is becoming increasingly difficult due to the growing penetration of distributed energy resources, high computational burden, repeated power flow calculations, and increasing privacy concerns. This paper proposes a physics-informed fully distributed reinforcement learning framework that enables autonomous voltage regulation using only local smart meter data. A Thevenin-equivalent-based local voltage estimation model and a hybrid correction mechanism are developed to support accurate local decision-making without synchronized global measurements or centralized power flow solvers. A lightweight coordination mechanism is further introduced to refine the actions of independently trained local agents. Case studies show that the proposed framework reduces voltage violations by approximately 80%, achieves performance close to that of power flow-based training environments, and achieves a training speedup of about 6×[jls-end-space/]. The results also indicate that the relaxation factors in the reward function and the coordination scaler are critical to voltage regulation efficiency, whereas the discount factor has a smaller impact. These findings demonstrate the practicality of the proposed framework for privacy-aware fully distributed voltage regulation.