Journal of System Simulation ›› 2025, Vol. 37 ›› Issue (11): 2877-2887.doi: 10.16182/j.issn1004731x.joss.24-0604

• Papers • Previous Articles    

Evolutionary Reinforcement Learning Based on Elite Instruction and Random Search

Di Jian1,2, Wan Xue1, Jiang Limei1,3   

  1. 1.Department of Computer, North China Electric Power University (Baoding), Baoding 071003, China
    2.Hebei Key Lab of Knowledge Computing for Energy & Power, Baoding 071003, China
    3.Engineering Research Center of Intelligent Computing for Complex Energy Systems, Ministry of Education, Baoding 071003, China
  • Received:2024-06-04 Revised:2024-07-24 Online:2025-11-18 Published:2025-11-27
  • Contact: Jiang Limei

Abstract:

Evolutionary reinforcement learning currently suffers from low sample efficiency, a single coupling method, and poor convergence, which can affect its performance and scaling. To address this issue, an improved algorithm based on elite gradient instruction and double random search was proposed. The direction of the reinforcement strategy gradient update was corrected by introducing elite strategy gradient guidance carrying evolutionary information during reinforcement strategy training. Double stochastic search was used to replace the original evolutionary component, reducing the complexity of the algorithm while making the policy search meaningful and controllable in the parameter space. The introduction of complete replacement information trading effectively balanced the learning and search of reinforcement and evolutionary strategies. Experimental results show that the method has improved exploration power, robustness, and convergence compared to the classical evolutionary reinforcement learning method.

Key words: evolutionary reinforcement learning, deep reinforcement learning, evolutionary algorithm, continuous control, elite gradient instruction

CLC Number: