Journal of System Simulation ›› 2025, Vol. 37 ›› Issue (10): 2511-2521.doi: 10.16182/j.issn1004731x.joss.25-0529

• Simulation Technology for New Power System and Integrated Energy System • Previous Articles    

Optimization Dispatch Method for High-proportion Renewable Energy Power Systems Based on SC-PPO

Xu Zhongkai1, Chu Chenyang1, Xie Kai1, Zhao Ruizhuo2, Ke Wenjun3   

  1. 1.NR Electric Co. , Ltd. , Nanjing 211102, China
    2.Beijing Institute of Computer Technology and Application, Beijing 100854, China
    3.School of Computer Science and Engineering, Southeast University, Nanjing 211189, China
  • Received:2025-06-09 Revised:2025-09-11 Online:2025-10-20 Published:2025-10-21

Abstract:

The high proportion of renewable energy integration brings significant challenges of randomness, multi-objective coupling, and security constraints to power systems. Traditional model-driven methods have limitations in modeling accuracy and adaptability. To address these issues, this paper proposed a safety-constrained PPO algorithm (SC-PPO). The method included three improvements. A temporal convolutional network was utilized to construct a dynamic state encoder that integrated historical operation, real-time monitoring, and prediction data to form a causal state representation. A hierarchical reward structure was designed, and an adaptive weighting mechanism based on constraint satisfaction degree was introduced to coordinate multi-objective optimization. Physical constraint projection operators were embedded in the policy output layer, transforming constraints such as unit ramp rates, energy storage state of charge, and voltage magnitudes into feasible region mappings in the action space. Simulation results show that SC-PPO reduces voltage limit violations by 75% while improving wind power accommodation rate to 95.6% and reducing carbon emissions to 15 220 t, the research provides a new paradigm of intelligent decision-making that combines adaptability and security for high renewable energy penetration power systems.

Key words: deep reinforcement learning, power system dispatch, dynamic state representation, hierarchical reward mechanism, security constraint embedding

CLC Number: