| [1] |
Li Shouyi, Chen Mou, Wang Yuhui, et al. Air Combat Decision-making of Multiple UCAVs Based on Constraint Strategy Games[J]. Defence Technology, 2022, 18(3): 368-383.
|
| [2] |
雍宇晨, 李子豫, 董琦. 基于分层多智能体强化学习的多无人机视距内空战[J]. 智能系统学报, 2025, 20(3): 548-556.
|
|
Yong Yuchen, Li Ziyu, Dong Qi. Multi-UAV Within-visual-range Air Combat Based on Hierarchical Multiagent Reinforcement Learning[J]. CAAI Transactions on Intelligent Systems, 2025, 20(3): 548-556.
|
| [3] |
Wu Mingxi. Intelligent Warfare: Prospects of Military Development in the Age of AI[M]. London: Routledge, 2022.
|
| [4] |
Murat Perit Çakır, Gürakar Gökhan. Towards Intelligent Flight Simulator Training[J]. The Journal of the JAPCC, 2023, 36: 46-53.
|
| [5] |
Jordan Javier. The Future of Unmanned Combat Aerial Vehicles: an Analysis Using the Three Horizons Framework[J]. Futures, 2021, 134: 102848.
|
| [6] |
梁晓龙, 杨爱武, 张佳强, 等. 无人集群博弈对抗系统仿真验证及决策关键技术综述[J]. 系统仿真学报, 2024, 36(4): 805-816.
|
|
Liang Xiaolong, Yang Aiwu, Zhang Jiaqiang, et al. Simulation Verification and Decision-making Key Technologies of Unmanned Swarm Game Confrontation: A Survey[J]. Journal of System Simulation, 2024, 36(4): 805-816.
|
| [7] |
Li Yuxi. Deep Reinforcement Learning: An Overview[EB/OL]. (2017-01-25) [2025-05-02]. .
|
| [8] |
Wang Xinwei, Wang Yihui, Su Xichao, et al. Deep Reinforcement Learning-based Air Combat Maneuver Decision-making: Literature Review, Implementation Tutorial and Future Direction[J]. Artificial Intelligence Review, 2024, 57(1): 1.
|
| [9] |
BENGIO Y, GOODFELLOW I, COURVILLE A. Deep Learning[M].Cambridge, Massachusetts: University Press of the Massachusetts Institute of Technology, 2017.
|
| [10] |
SUTTON R S, BARTO A G. Reinforcement Learning: An Introduction[M]. Cambridge, Massachusetts: University Press of the Massachusetts Institute of Technology, 2018.
|
| [11] |
Li Yurui, Chen Yuxuan, Zhang Li, et al. The Composite Task Challenge for Cooperative Multi-Agent Reinforcement Learning[EB/OL]. (2025-02-01) [2025-05-02]. .
|
| [12] |
施伟, 冯旸赫, 程光权, 等. 基于深度强化学习的多机协同空战方法研究[J]. 自动化学报, 2021, 47(7): 1610-1623.
|
|
Shi Wei, Feng Yanghe, Cheng Guangquan, et al. Research on Multi-aircraft Cooperative Air Combat Method Based on Deep Reinforcement Learning[J]. Acta Automatica Sinica, 2021, 47(7): 1610-1623.
|
| [13] |
Foerster J N, Farquhar G, Afouras T, et al. Counterfactual Multi-agent Policy Gradients[C]//Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence. Palo Alto: AAAI Press, 2018: 363.
|
| [14] |
Li Shaowei, Jia Yuhong, Yang Fan, et al. Collaborative Decision-making Method for Multi-UAV Based on Multiagent Reinforcement Learning[J]. IEEE Access, 2022, 10: 91385-91396.
|
| [15] |
Liu Xiaoxiong, Yin Yi, Su Yuzhan, et al. A Multi-UCAV Cooperative Decision-making Method Based on an MAPPO Algorithm for Beyond-visual-range Air Combat[J]. Aerospace, 2022, 9(10): 563.
|
| [16] |
Xiaohong Nian, Li Mengmeng, Wang Haibo, et al. Large-scale UAV Swarm Confrontation Based on Hierarchical Attention Actor-critic Algorithm[J]. Applied Intelligence, 2024, 54(4): 3279-3294.
|
| [17] |
符小卫, 王辉, 徐哲. 基于DE-MADDPG的多无人机协同追捕策略[J]. 航空学报, 2022, 43(5): 522-535.
|
|
Fu Xiaowei, Wang Hui, Xu Zhe. Cooperative Pursuit Strategy for Multi-UAVs Based on DE-MADDPG Algorithm[J]. Acta Aeronautica et Astronautica Sinica, 2022, 43(5): 522-535.
|
| [18] |
陈灿, 莫雳, 郑多, 等. 非对称机动能力多无人机智能协同攻防对抗[J]. 航空学报, 2020, 41(12): 336-348.
|
|
Chen Can, Mo Li, Zheng Duo, et al. Cooperative Attack-defense Game of Multiple UAVs with Asymmetric Maneuverability[J]. Acta Aeronautica et Astronautica Sinica, 2020, 41(12): 336-348.
|
| [19] |
孙智孝, 杨晟琦, 朴海音, 等. 未来智能空战发展综述[J]. 航空学报, 2021, 42(8): 28-42.
|
|
Sun Zhixiao, Yang Shengqi, Haiyin Piao, et al. A Survey of Air Combat Artificial Intelligence[J]. Acta Aeronautica et Astronautica Sinica, 2021, 42(8): 28-42.
|
| [20] |
TALAY T A. Introduction to the Aerodynamics of Flight[EB/OL]. (1975-01-01) [2025-05-02]. .
|
| [21] |
SHAW R L. Fighter Combat[M]. Annapolis, Maryland: Tactics and Maneuvering, 1985: 62-97.
|
| [22] |
Zheng Zhiqiang, Duan Haibin. UAV Maneuver Decision-making Via Deep Reinforcement Learning for Short-range Air Combat[J]. Intelligence & Robotics, 2023, 3(1): 76-94.
|
| [23] |
Yang Qiming, Zhang Jiandong, Shi Guoqing, et al. Maneuver Decision of UAV in Short-range Air Combat Based on Deep Reinforcement Learning[J]. IEEE Access, 2020, 8: 363-378.
|
| [24] |
Cho Kyunghyun, van Merriënboer Bart, Gulcehre Caglar, et al. Learning Phrase Representations Using RNN Encoder-decoder for Statistical Machine Translation[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Stroudsburg: ACL, 2014: 1724-1734.
|
| [25] |
SCHAUL T, Quan J, ANTONOGLOU I, et al. Prioritized Experience Replay[C]//4th International Conference on Learning Representations. Puerto Rico: ICLR, 2016: 1-13.
|
| [26] |
Sutton R S. Learning to Predict by the Methods of Temporal Differences[J]. Machine Learning, 1988, 3(1): 9-44.
|
| [27] |
KINGMA D P, Ba J. Adam: A Method for Stochastic Optimization[C]//3rd International Conference on Learning Representations. San Diego: ICLR 2015: 1-15.
|
| [28] |
Lowe Ryan, Wu Yi, Tamar A, et al. Multi-agent Actor-critic for Mixed Cooperative-competitive Environments[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2017: 6382-6393.
|