基于PAC-Bayes的多目标强化学习A2C算法研究
刘翔, 金乾坤
Research on PAC-Bayes-Based A2C Algorithm for Multi-objective Reinforcement Learning
Liu Xiang, Jin Qiankun
系统仿真学报 . 2025, (12): 3212 -3223 .  DOI: 10.16182/j.issn1004731x.joss.25-FZ0691