Efficient optimal power flow learning: A deep reinforcement learning with physics-driven critic model

Ahmed Sayed; Khaled Al Jaafari; Xian Zhang; Hatem Zeineldin; Ahmed Al-Durra; Guibin Wang; Ehab Elsaadany

doi:10.1016/j.ijepes.2025.110621

International Journal of Electrical Power & Energy Systems (Jun 2025)

Efficient optimal power flow learning: A deep reinforcement learning with physics-driven critic model

Ahmed Sayed,
Khaled Al Jaafari,
Xian Zhang,
Hatem Zeineldin,
Ahmed Al-Durra,
Guibin Wang,
Ehab Elsaadany

Affiliations

Ahmed Sayed: Electrical and Computer Engineering, Khalifa University, Abu Dhabi, 127788, United Arab Emirates; Faculty of Engineering, Cairo University, Giza, 12613, Egypt; Corresponding author at: Electrical and Computer Engineering, Khalifa University, Abu Dhabi, 127788, United Arab Emirates.
Khaled Al Jaafari: Electrical and Computer Engineering, Khalifa University, Abu Dhabi, 127788, United Arab Emirates
Xian Zhang: Mechanical Engineering and Automation, Harbin Institute of Technology, Shenzhen, 518055, China
Hatem Zeineldin: Electrical and Computer Engineering, Khalifa University, Abu Dhabi, 127788, United Arab Emirates; Faculty of Engineering, Cairo University, Giza, 12613, Egypt
Ahmed Al-Durra: Electrical and Computer Engineering, Khalifa University, Abu Dhabi, 127788, United Arab Emirates
Guibin Wang: Mechatronics and Control Engineering, Shenzhen University, Shenzhen, 518060, China; Corresponding author at: Mechatronics and Control Engineering, Shenzhen University, Shenzhen, 518060, China.
Ehab Elsaadany: Electrical and Computer Engineering, Khalifa University, Abu Dhabi, 127788, United Arab Emirates

DOI: https://doi.org/10.1016/j.ijepes.2025.110621
Journal volume & issue: Vol. 167
p. 110621

Abstract

Read online

The transition to decarbonized energy systems presents significant operational challenges due to increased uncertainties and complex dynamics. Deep reinforcement learning (DRL) has emerged as a powerful tool for optimizing power system operations. However, most existing DRL approaches rely on approximated data-driven critic networks, requiring numerous risky interactions to explore the environment and often facing estimation errors. To address these limitations, this paper proposes an efficient DRL algorithm with a physics-driven critic model, namely a differentiable holomorphic embedding load flow model (D-HELM). This approach enables accurate policy gradient computation through a differentiable loss function based on system states of realized uncertainties, simplifying both the replay buffer and the learning process. By leveraging continuation power flow principles, D-HELM ensures operable, feasible solutions while accelerating gradient steps through simple matrix operations. Simulation results across various test systems demonstrate the computational superiority of the proposed approach, outperforming state-of-the-art DRL algorithms during training and model-based solvers in online operations. This work represents a potential breakthrough in real-time energy system operations, with extensions to security-constrained decision-making, voltage control, unit commitment, and multi-energy systems.

Published in International Journal of Electrical Power & Energy Systems

ISSN: 0142-0615 (Print); 1879-3517 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Production of electric energy or power. Powerplants. Central stations
Website: https://www.sciencedirect.com/journal/international-journal-of-electrical-power-and-energy-systems

About the journal

Abstract

Keywords