系统仿真学报 ›› 2026, Vol. 38 ›› Issue (4): 988-1003.doi: 10.16182/j.issn1004731x.joss.24-0922

• 论文 • 上一篇    下一篇

云原生仿真驱动的智能竞赛平台与模式

秦龙1, 黄鹤松1, 尹路珈1, 艾川1, 张琪1, 李新梦2   

  1. 1.国防科技大学 系统工程学院,湖南 长沙 410073
    2.湖南先进技术研究院,湖南 长沙 410072
  • 收稿日期:2024-08-21 修回日期:2024-10-11 出版日期:2026-04-20 发布日期:2026-04-22
  • 通讯作者: 黄鹤松
  • 第一作者简介:秦龙(1984-),男,副研究员,博士,研究方向为军事建模与仿真。
  • 基金资助:
    湖南省自然科学基金(2024JJ6478)

Intelligent Competition Platform and Mode Driven by Cloud-native Simulation

Qin Long1, Huang Hesong1, Yin Lujia1, Ai Chuan1, Zhang Qi1, Li Xinmeng2   

  1. 1.College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
    2.Hunan Institute of Advanced Technology, Changsha 410072, China
  • Received:2024-08-21 Revised:2024-10-11 Online:2026-04-20 Published:2026-04-22
  • Contact: Huang Hesong

摘要:

为解决智能体对抗竞赛模式存在开发部署困难、资源利用率低、可复用性差、难以接入强化学习算法等问题,设计了一种新型的智能体仿真训练平台。基于云原生技术解耦了竞赛平台软件组成要素;提出了一种面向竞赛环境的高性能仿真引擎;设计了一种新的智能控制端嵌入强化学习模型方法,内置多种在线策略、离线策略强化学习算法。实验表明:系统开发部署高效,降低了对参赛者硬件设备的要求,且参赛者不需要安装任何应用,一键登录系统;系统运行高效,通过云服务化设计,有效解决并发能力弱、可靠性差、响应延迟等问题;系统与智能训练算法适配度高并支持用户对训练、推演模块和主要参数进行调节,降低参赛者智能化训练门槛。

关键词: 互联网模式, 云原生, 强化学习, B/S架构, 组件化建模

Abstract:

To solve the problems faced by the adversarial competition mode of agents, including difficult development and deployment, low resource utilization, poor reusability, and difficulty in accessing reinforcement learning algorithms, a new agent simulation training platform was designed. The software components of the competition platform were decoupled based on cloud-native technology; a high-performance simulation engine for the competition environment was proposed; a new method of an embedded reinforcement learning model for an intelligent control terminal was designed, with multiple online and offline policy-based reinforcement learning algorithms set.The experiment demonstrates that the development and deployment of the system is efficient, which reduces the requirements for the hardware equipment of the participants, and the participants can log in to the system via one click without installing any applications. The system runs efficiently and effectively solves problems such as weak concurrency, poor reliability, and response delay through cloud service design. The system is highly adapted to the intelligent training algorithm and supports users to adjust the training, deduction modules, and main parameters, so as to reduce the threshold of intelligent training for participants.

Key words: internet mode, cloud-native, RL, B/S architecture, component-based modeling

中图分类号: