深度强化学习中策略表征研究简述
陈真, 吴卓屹, 张霖
Research on Policy Representation in Deep Reinforcement Learning
Chen Zhen, Wu Zhuoyi, Zhang Lin
系统仿真学报 . 2025, (7): 1753 -1769 .  DOI: 10.16182/j.issn1004731x.joss.25-0533