Journal of System Simulation ›› 2023, Vol. 35 ›› Issue (10): 2249-2261.doi: 10.16182/j.issn1004731x.joss.23-FZ0824
• Papers • Previous Articles Next Articles
Wang Yukun(), Wang Ze, Dong Liwei, Li Ni(
)
Received:
2023-07-03
Revised:
2023-09-15
Online:
2023-10-30
Published:
2023-10-26
Contact:
Li Ni
E-mail:wyk_13@foxmail.com;lini@buaa.edu.cn
CLC Number:
Wang Yukun, Wang Ze, Dong Liwei, Li Ni. Research on Multi-aircraft Air Combat Behavior Modeling Based on Hierarchical Intelligent Modeling Methods[J]. Journal of System Simulation, 2023, 35(10): 2249-2261.
1 | Holcomb S D, Porter W K, Ault S V, et al. Overview on DeepMind and Its AlphaGo Zero AI[C]//Proceedings of the 2018 International Conference on Big Data and Education. New York, NY, USA: Association for Computing Machinery, 2018: 67-71. |
2 | Arulkumaran K, Cully A, Togelius J. AlphaStar: an Evolutionary Computation Perspective[C]//Proceedings of the Genetic and Evolutionary Computation Conference Companion. New York, NY, USA: Association for Computing Machinery, 2019: 314-315. |
3 | Berner C, Brockman G, Chan B, et al. Dota 2 With Large Scale Deep Reinforcement Learning[EB/OL]. (2019-12-13) [2023-05-10]. . |
4 | 杨惟轶, 白辰甲, 蔡超, 等. 深度强化学习中稀疏奖励问题研究综述[J]. 计算机科学, 2020, 47(3): 182-191. |
Yang Weiyi, Bai Chenjia, Cai Chao, et al. Survey on Sparse Reward in Deep Reinforcement Learning[J]. Computer Science, 2020, 47(3): 182-191. | |
5 | Chen G. A New Framework for Multi-agent Reinforcement Learning-centralized Training and Exploration With Decentralized Execution via Policy Distillation[EB/OL]. (2019-10-21) [2022-11-02]. . |
6 | Lowe R, Wu Yi, Tamar A, et al. Multi-agent Actor-critic for Mixed Cooperative-competitive Environments[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook, NY, USA: Curran Associates Inc., 2017: 6382-6393. |
7 | Marthi B. Automatic Shaping and Decomposition of Reward Functions[C]//Proceedings of the 24th International Conference on Machine learning. New York, NY, USA: Association for Computing Machinery, 2007: 601-608. |
8 | Chen Jiayu, Zhang Yuanxin, Xu Yuanfan, et al. Variational Automatic Curriculum Learning for Sparse-reward Cooperative Multi-agent Problems[C]//35th Conference on Neural Information Processing Systems. Red Hook, NY, USA: Curran Associates Inc., 2021, 34: 9681-9693. |
9 | Hu Yujing, Wang Weixun, Jia Hangtian, et al. Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping[C]//34th Conference on Neural Information Processing Systems. Red Hook, NY, USA: Curran Associates Inc., 2020: 15931-15941. |
10 | Zhelo O, Zhang Jingwei, Tai Lei, et al. Curiosity-driven Exploration for Mapless Navigation With Deep Reinforcement Learning[EB/OL]. (2018-05-14) [2023-06-21]. . |
11 | Wang Xin, Chen Yudong, Zhu Wenwu. A Survey on Curriculum Learning[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(9): 4555-4576. |
12 | Hutsebaut-Buysse M, Mets K, Latré S. Hierarchical Reinforcement Learning: A Survey and Open Research Challenges[J]. Machine Learning & Knowledge Extraction, 2022, 4(1): 172-221. |
13 | 周攀, 黄江涛, 章胜, 等. 基于深度强化学习的智能空战决策与仿真[J]. 航空学报, 2023, 44(4): 94-107. |
Zhou Pan, Huang Jiangtao, Zhang Sheng, et al. Intelligent Air Combat Decision Making and Simulation Based on Deep Reinforcement Learning[J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(4): 94-107. | |
14 | 李永丰, 史静平, 章卫国, 等. 深度强化学习的无人作战飞机空战机动决策[J]. 哈尔滨工业大学学报, 2021, 53(12): 33-41. |
Li Yongfeng, Shi Jingping, Zhang Weiguo, et al. Maneuver Decision of UCAV in Air Combat Based on Deep Reinforcement Learning[J]. Journal of Harbin Institute of Technology, 2021, 53(12): 33-41. | |
15 | Wang Zhuang, Li Hui, Wu Haolin, et al. Improving Maneuver Strategy in Air Combat by Alternate Freeze Games With a Deep Reinforcement Learning Algorithm[J]. Mathematical Problems in Engineering, 2020, 2020: 7180639. |
16 | 章胜, 杜昕, 肖娟, 等. 基于深度强化学习的固定翼飞行器六自由度飞行智能控制[J]. 指挥与控制学报, 2022, 8(2): 179-188. |
Zhang Sheng, Du Xin, Xiao Juan, et al. Fixed-wing Aircraft 6-DOF Flight Control Based on Deep Reinforcement Learning[J]. Journal of Command and Control, 2022, 8(2): 179-188. | |
17 | 孙智孝, 杨晟琦, 朴海音, 等. 未来智能空战发展综述[J]. 航空学报, 2021, 42(8): 28-42. |
Sun Zhixiao, Yang Shengqi, Haiyin Piao, et al. A Survey of Air Combat Artificial Intelligence[J]. Acta Aeronautica et Astronautica Sinica, 2021, 42(8): 28-42. | |
18 | Tashev B, Purcell M, McLaughlin B. Russia's Information Warfare: Exploring the Cognitive Dimension[J]. MCU Journal, 2019, 10(2): 129-147. |
19 | Hamfelt A, Karlsson M, Thierfelder T, et al. Beyond K-means: Clusters Identification for GIS[M]//Popovich V V, Claramunt C, Devogele T, et al. Information Fusion and Geographic Information Systems: Towards the Digital Ocean. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011: 93-105. |
20 | Rashid T, Farquhar G, Peng Bei, et al. Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-agent Reinforcement Learning[C]//34th Conference on Neural Information Processing Systems. Red Hook, NY, USA: Curran Associates Inc., 2020: 10199-10210. |
[1] | Ding Shi, Xuefeng Yan, Lina Gong, Jingxuan Zhang, Donghai Guan, Mingqiang Wei. Multi-agent Cooperative Combat Simulation in Naval Battlefield with Reinforcement Learning [J]. Journal of System Simulation, 2023, 35(4): 786-796. |
[2] | Ying Xu, Shuai Zhang, Zhige Xie, Xinhai Xu, Manhui Sun, Ning Guo. A Simulation Method of Airborne Radar Real-time Detection Based on Three-dimensional Subdivision [J]. Journal of System Simulation, 2023, 35(2): 268-276. |
[3] | Baiyuan Ding, Fuling Mu, Yunpeng Li, Zhongkuan Chen, Chengyu Liu. Design of System Combat Simulation Platform for Complex Electromagnetic Environment [J]. Journal of System Simulation, 2023, 35(2): 330-338. |
[4] | Zheng Yang, Zhimin Xiang, Shiwen Ma. A Method of Loose Coupling Entity Modeling Based on Variable Rules [J]. Journal of System Simulation, 2022, 34(7): 1506-1511. |
[5] | Lingjia Ni, Xiaoxia Huang, Hongga Li, Zibo Zhang. Research on Fire Emergency Evacuation Simulation Based on Cooperative Deep Reinforcement Learning [J]. Journal of System Simulation, 2022, 34(6): 1353-1366. |
[6] | Xin Zhou, Weiping Wang, Yifan Zhu, Tao Wang, Tian Jing. An Unmanned Swarm Search Method Based on Human-Robot Cooperation [J]. Journal of System Simulation, 2022, 34(4): 735-744. |
[7] | Qi Xiaolong, Yang Xuguang. Time-varying Output Formation Tracking Control of Discrete-time Heterogeneous Multi-agent Systems [J]. Journal of System Simulation, 2022, 34(1): 36-44. |
[8] | Gui Xindong, Ji Hongjiang, Fan Lingling, Liu Shida. Application of Trust Driven Adaptive Cooperative Control Algorithm [J]. Journal of System Simulation, 2021, 33(8): 1809-1817. |
[9] | Xie Xu, Qiu Xiaogang, Duan Hong, Huang Kedi. Research on Combat Simulation Body of Knowledge [J]. Journal of System Simulation, 2021, 33(4): 773-780. |
[10] | Lei Yonglin, Zhu Zhi, Gan bin, Lei Sen, Chen Yong. Combat Effectiveness Simulation Evaluation Framework of Complex Weapon System [J]. Journal of System Simulation, 2020, 32(9): 1654-1663. |
[11] | Wu Wei. Research on Dynamic Editable Method for Combat Simulation Model Combination [J]. Journal of System Simulation, 2020, 32(5): 967-974. |
[12] | Liang Wei, Huang Yanyan, Wang Jianyu. Modeling Method of Emergency Response based on Internal and External Areas of Disaster Zone [J]. Journal of System Simulation, 2020, 32(4): 669-677. |
[13] | Yu Ying, Liang Weidong, Zhu Xiujuan, Wang Jianlin, Wang Ling. Design of Combat Simulation and Visualization System of Hypersonic Vehicle [J]. Journal of System Simulation, 2019, 31(12): 2584-2590. |
[14] | Fan Rui, Tan Yaxin, Huang Junqing. Analysis of Reconnaissance System Modeling and Simulation in Combat Simulation [J]. Journal of System Simulation, 2018, 30(9): 3480-3483. |
[15] | Hou Qihao, Yao Yiping, Cao Xiang. Research on Battle Damage Assessment Method of Aggregating Target in Theater-Level [J]. Journal of System Simulation, 2018, 30(12): 4580-4586. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||