Research on PAC-Bayes-Based A2C Algorithm for Multi-objective Reinforcement Learning
Liu Xiang, Jin Qiankun
Journal of System Simulation . 2025, (12): 3212 -3223 .  DOI: 10.16182/j.issn1004731x.joss.25-FZ0691