稀疏奖励下多航天器规避决策自学习仿真
赵毓, 郭继峰, 颜鹏, 白成超
Self-learning-based Multiple Spacecraft Evasion Decision Making Simulation Under Sparse Reward Condition
Zhao Yu, Guo Jifeng, Yan Peng, Bai Chengchao
系统仿真学报 . 2021, (8): 1766 -1774 .  DOI: 10.16182/j.issn1004731x.joss.21-0432