强化学习驱动的海战场多智能体协同作战仿真算法 |
石鼎, 燕雪峰, 宫丽娜, 张静宣, 关东海, 魏明强 |
Multi-agent Cooperative Combat Simulation in Naval Battlefield with Reinforcement Learning |
Ding Shi, Xuefeng Yan, Lina Gong, Jingxuan Zhang, Donghai Guan, Mingqiang Wei |
图7 红方战斗机回合平均奖励 |
Fig. 7 Episode average rewards of red fighter plane |
![]() |