Skip to main navigation Skip to search Skip to main content

Pre-Trained Actor-Guided Proximal Policy Optimization for Feasible and Safe AC Optimal Power Flow in Large-Scale Power Systems

  • Liangcai Zhou
  • , Pan Wu
  • , Rui Chen
  • , Xin Chen
  • State Grid Corporation of China
  • Xi'an Jiaotong University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Real-time AC Optimal Power Flow (AC-OPF) optimization poses a critical challenge in modern power systems, due to the computational complexity of traditional solvers and the infeasibility issues associated with the reinforcement learning (RL) methods. We propose Pre-trained Actor-Guided Proximal Policy Optimization (PAG-PPO). This framework initializes an RL agent with the actor neural network cloning from historical OPF solutions to enhance learning efficiency. This approach enables rapid convergence to physically feasible and numerically stable control policies, ensuring consistent power flow convergence while minimizing feasible action exploration in the training. Evaluated on the IEEE 300-bus system under diverse load conditions and line outages, PAG-PPO achieves power flow convergence in 500 episodes. Compared to the MATPOWER solver, PAG-PPO reduces inference time by 114×, with a 0.79% increase in generation cost and a 99.6% feasibility rate. Infeasible cases involve minor voltage violations, with no power flow divergence. These results demonstrate that PAG-PPO offers a practical, scalable, and near-optimal solution for real-time AC-OPF optimization, advancing RL-based decision-making for large-scale power systems.

Original languageEnglish
Title of host publication2025 IEEE 9th Conference on Energy Internet and Energy System Integration, EI2 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages850-855
Number of pages6
ISBN (Electronic)9798331548599
DOIs
StatePublished - 2025
Event2025 IEEE 9th Conference on Energy Internet and Energy System Integration, EI2 2025 - Jilin, China
Duration: 5 Dec 20258 Dec 2025

Publication series

Name2025 IEEE 9th Conference on Energy Internet and Energy System Integration, EI2 2025

Conference

Conference2025 IEEE 9th Conference on Energy Internet and Energy System Integration, EI2 2025
Country/TerritoryChina
CityJilin
Period5/12/258/12/25

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • AC optimal power flow
  • proximal policy optimization (PPO)
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Pre-Trained Actor-Guided Proximal Policy Optimization for Feasible and Safe AC Optimal Power Flow in Large-Scale Power Systems'. Together they form a unique fingerprint.

Cite this