Multi-agent Reinforcement Learning Method for Wargame Simulation Based on Suboptimal Demonstration Guidance
Zhou Zicong, Zeng Junjie, Hu Yue, Zhu Zhengqiu, Yin Quanjun
Journal of System Simulation . 2026, (5): 1277 -1289 .  DOI: 10.16182/j.issn1004731x.joss.25-0743