强化学习驱动的海战场多智能体协同作战仿真算法 |
| 石鼎, 燕雪峰, 宫丽娜, 张静宣, 关东海, 魏明强 |
|
Multi-agent Cooperative Combat Simulation in Naval Battlefield with Reinforcement Learning |
| Ding Shi, Xuefeng Yan, Lina Gong, Jingxuan Zhang, Donghai Guan, Mingqiang Wei |
| 图7 红方战斗机回合平均奖励 |
| Fig. 7 Episode average rewards of red fighter plane |
|