Journal of System Simulation ›› 2026, Vol. 38 ›› Issue (2): 372-386.doi: 10.16182/j.issn1004731x.joss.25-0486
• Machine Learning Algorithms • Previous Articles
Wang Bingkun1, Wang Yue1, Yang Mei2, Zhang Pengnian1, Fan Bohao1, Tang Jie1
Received:2025-05-28
Revised:2025-07-29
Online:2026-02-18
Published:2026-02-11
Contact:
Wang Yue
CLC Number:
Wang Bingkun, Wang Yue, Yang Mei, Zhang Pengnian, Fan Bohao, Tang Jie. Strike Strategy Planning Method of Unmanned Ground Vehicles Based on Improved PPO Algorithm[J]. Journal of System Simulation, 2026, 38(2): 372-386.
| [1] | Wang Tong, Fu Liyue, Wei Zhengxian, et al. Unmanned Ground Weapon Target Assignment Based on Deep Q-learning Network with an Improved Multi-objective Artificial Bee Colony Algorithm[J]. Engineering Applications of Artificial Intelligence, 2023, 117, Part B: 105612. |
| [2] | Jia Yingjuan, Qu Liangdong, Li Xiaoqin. Automatic Path Planning of Unmanned Combat Aerial Vehicle Based on Double-layer Coding Method with Enhanced Grey Wolf Optimizer[J]. Artificial Intelligence Review, 2023, 56(10): 12257-12314. |
| [3] | Ahn Jisoo, Jung Sewoong, Kim Hansom, et al. A Study on Unmanned Combat Vehicle Path Planning for Collision Avoidance with Enemy Forces in Dynamic Situations[J]. Journal of Computational Design and Engineering, 2023, 10(6): 2251-2270. |
| [4] | 王霄龙, 陈洋, 胡棉, 等. 基于改进深度Q网络的机器人持续监测路径规划[J]. 兵工学报, 2024, 45(6): 1813-1823. |
| Wang Xiaolong, Chen Yang, Hu Mian, et al. Robot Path Planning for Persistent Monitoring Based on Improved Deep Q Networks[J]. Acta Armamentarii, 2024, 45(6): 1813-1823. | |
| [5] | Qu Liangdong, Jia Yingjuan, Li Xiaoqin, et al. Two-stage Control Model Based on Enhanced Elephant Clan Optimization for Path Planning of Unmanned Combat Aerial Vehicle[J]. The Journal of Supercomputing, 2024, 80(17): 24938-24974. |
| [6] | Zhang Haojie, Yang Tiantian, Su Zhibao. A Formation Cooperative Reconnaissance Strategy for Multi-UGVs in Partially Unknown Environment[J]. Journal of the Chinese Institute of Engineers, 2023, 46(6): 551-562. |
| [7] | Su Wenjia, Gao Min, Gao Xinbao, et al. An Online Attack Decision Method for Unmanned Aerial Vehicle Cluster in Uncertain Environments[J]. IEEE Sensors Journal, 2024, 24(11): 18457-18466. |
| [8] | 李传浩, 明振军, 王国新, 等. 基于多智能体深度强化学习的无人平台箔条干扰末端防御动态决策方法[J]. 兵工学报, 2025, 46(3): 19-33. |
| Li Chuanhao, Ming Zhenjun, Wang Guoxin, et al. Dynamic Decision-making Method of Unmanned Platform Chaff Jamming for Terminal Defense Based on Multi-agent Deep Reinforcement Learning[J]. Acta Armamentarii, 2025, 46(3): 19-33. | |
| [9] | Wang Ting, Deng Yuxiang, Yang Zhao, et al. Parameterized Deep Reinforcement Learning with Hybrid Action Space for Edge Task Offloading[J]. IEEE Internet of Things Journal, 2024, 11(6): 10754-10767. |
| [10] | 张森, 代强强. 改进型深度确定性策略梯度的无人机路径规划[J]. 系统仿真学报, 2025, 37(4): 875-881. |
| Zhang Sen, Dai Qiangqiang. UAV Path Planning Based on Improved Deep Deterministic Policy Gradients[J]. Journal of System Simulation, 2025, 37(4): 875-881. | |
| [11] | 张建东, 王鼎涵, 杨啟明, 等. 基于分层强化学习的无人机空战多维决策[J]. 兵工学报, 2023, 44(6): 1547-1563. |
| Zhang Jiandong, Wang Dinghan, Yang Qiming, et al. Multi-dimensional Decision-making for UAV Air Combat Based on Hierarchical Reinforcement Learning[J]. Acta Armamentarii, 2023, 44(6): 1547-1563. | |
| [12] | 李超, 王瑞星, 黄建忠, 等. 稀疏奖励下基于强化学习的无人集群自主决策与智能协同[J]. 兵工学报, 2023, 44(6): 1537-1546. |
| Li Chao, Wang Ruixing, Huang Jianzhong, et al. Autonomous Decision-making and Intelligent Collaboration of UAV Swarms Based on Reinforcement Learning with Sparse Rewards[J]. Acta Armamentarii, 2023, 44(6): 1537-1546. | |
| [13] | Ma Chengdong, Liu Jianan, He Saichao, et al. Confrontation and Obstacle-avoidance of Unmanned Vehicles Based on Progressive Reinforcement Learning[J]. IEEE Access, 2023, 11: 50398-50411. |
| [14] | Yue Longfei, Yang Rennong, Zhang Ying, et al. Deep Reinforcement Learning for UAV Intelligent Mission Planning[J]. Complexity, 2022, 2022: 3551508. |
| [15] | Liu Wei, Zhang Tao, Huang Shengjun, et al. A Hybrid Optimization Framework for UAV Reconnaissance Mission Planning[J]. Computers & Industrial Engineering, 2022, 173: 108653. |
| [16] | Xiong Jiechao, Wang Qing, Yang Zhuoran, et al. Parametrized Deep Q-networks Learning: Reinforcement Learning with Discrete-continuous Hybrid Action Space[EB/OL]. (2018-10-10) [2025-04-01]. . |
| [17] | Fan Zhou, Su Rui, Zhang Weinan, et al. Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space[C]//Proceedings of the 28th International Joint Conference on Artificial Intelligence. Palo Alto: AAAI Press, 2019: 2279-2285. |
| [18] | Cao Jingyu, Dong Lu, Sun Changyin. Hierarchical Reinforcement Learning for Kinematic Control Tasks with Parameterized Action Spaces[J]. Neural Computing and Applications, 2024, 36(1): 323-336. |
| [19] | Han Guangjie, Feng Zixiao, Wang Hao, et al. Underwater Multi-target Node Path Planning in Hybrid Action Space: A Deep Reinforcement Learning Approach[J]. IEEE Transactions on Mobile Computing, 2024, 23(12): 13033-13047. |
| [20] | Xu Yahao, Wei Yiran, Jiang Keyang, et al. Action Decoupled SAC Reinforcement Learning with Discrete-continuous Hybrid Action Spaces[J]. Neurocomputing, 2023, 537: 141-151. |
| [21] | He Yufei, Hu Ruiqi, Liang Kewei, et al. Deep Reinforcement Learning Algorithm with Long Short-term Memory Network for Optimizing Unmanned Aerial Vehicle Information Transmission[J]. Mathematics, 2025, 13(1): 46. |
| [22] | Zhang Cheng, Tao Chengyang, Xu Yuelei, et al. Autonomous Defense of Unmanned Aerial Vehicles Against Missile Attacks Using a GRU-based PPO Algorithm[J]. International Journal of Aeronautical and Space Sciences, 2024, 25(3): 1034-1049. |
| [23] | Hao Shuhui, Guan Wei, Cui Zhewen, et al. USV Collision Avoidance Decision-making Based on the Improved PPO Algorithm in Restricted Waters[J]. Journal of Marine Science and Engineering, 2024, 12(8): 1428. |
| [24] | Schulman J, Wolski F, Dhariwal P, et al. Proximal Policy Optimization Algorithms[EB/OL]. (2017-08-28) [2025-02-10]. . |
| [25] | 姜凌峰, 李新凯, 张海, 等. 基于改进TD3算法的无人机动态环境无地图导航[J]. 航空学报, 2025, 46(8): 292-307. |
| Jiang Lingfeng, Li Xinkai, Zhang Hai, et al. Mapless Navigation of UAVs in Dynamic Environments Based on an Improved TD3 Algorithm[J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(8): 292-307. |
| [1] | Wu Shuxia, Zhang Junjie, Chen Delong, Chen Zheyi. Resource-efficient Continuous Learning Framework for Edge Real-time Video Analytics [J]. Journal of System Simulation, 2026, 38(2): 294-306. |
| [2] | Zhu Ling, Li Jing, Zhang Zhaohui. An Adaptive Robot Path Planning Based on Improved REA* Algorithm [J]. Journal of System Simulation, 2026, 38(2): 332-345. |
| [3] | Yu Yiran, Lai Huicheng, Gao Guxue, Zhang Guo, Peng Wangyinan, Yang Longfei, Huang Junhao. Optimization Method for Multi Agricultural Machinery Collaborative Operation Based on Genetic Algorithm and A * Algorithm [J]. Journal of System Simulation, 2025, 37(9): 2397-2408. |
| [4] | Ni Peilong, Mao Pengjun, Wang Ning, Yang Mengjie. Robot Path Planning Based on Improved A-DDQN Algorithm [J]. Journal of System Simulation, 2025, 37(9): 2420-2430. |
| [5] | Gong Feng, Jiang Tao, Zhang Qin, Liu Yu. Simulation and Optimization of Support Processes for Aircraft Fleet Launch Under Limited Resources [J]. Journal of System Simulation, 2025, 37(8): 1965-1977. |
| [6] | Zhang Kaixiang, Mao Jianlin, Wang Niya, Xu Zhihao. Multi-robot Hierarchical Collaborative k-robust Path Planning for Path Interference [J]. Journal of System Simulation, 2025, 37(8): 2074-2088. |
| [7] | Zhang Yue, Zhang Wenliang, Feng Qiang, Guo Xing, Ren Yi, Wang Zili. Combat-oriented Comprehensive Simulation and Verification Technology for Equipment System RMS [J]. Journal of System Simulation, 2025, 37(7): 1823-1835. |
| [8] | Wan Yuhang, Zhu Zilu, Zhong Chunfu, Liu Yongkui, Lin Tingyu, Zhang Lin. Dynamic Path Planning for Robotic Arms Based on an Improved PPO Algorithm [J]. Journal of System Simulation, 2025, 37(6): 1462-1473. |
| [9] | Ye Chen, Shao Peng, Zhang Shaoping, Li Wenting, Zhou Tengming. Enhanced Artificial Gorilla Algorithm for Mobile Robot Path Planning [J]. Journal of System Simulation, 2025, 37(6): 1474-1485. |
| [10] | Zhang Yan, Li Binghua, Huo Tao, Liu Rong. Research on Robot Dynamic Obstacle Avoidance Method Based on Improved A* and Dynamic Window Algorithm [J]. Journal of System Simulation, 2025, 37(6): 1555-1564. |
| [11] | Wang Ziyi, Zhang Kai, Qian Dianwei, Liu Yuzhen. A DRL⁃based Approach for Distributed Equipment Nodes Selection [J]. Journal of System Simulation, 2025, 37(6): 1565-1573. |
| [12] | Wu Guohua, Zeng Jiaheng, Wang Dezhi, Zheng Long, Zou Wei. A Quadrotor Trajectory Tracking Control Method Based on Deep Reinforcement Learning [J]. Journal of System Simulation, 2025, 37(5): 1169-1187. |
| [13] | Zhou Xiaohui, Li Yanqiang, Wang Yong, Zhao Decai, Yang Xiaoyao. Robot Path Planning Based on Ant Colony Algorithm with Dual Heuristic Information [J]. Journal of System Simulation, 2025, 37(5): 1280-1289. |
| [14] | Yu Die, Bao Baizhong, Si Yan, Duan Jian, Zhan Xiaobin, Shi Tielin. Mobile Robot Path Planning Based on Search-step Optimized A* Algorithm [J]. Journal of System Simulation, 2025, 37(4): 1041-1050. |
| [15] | Zhang Sen, Dai Qiangqiang. UAV Path Planning Based on Improved Deep Deterministic Policy Gradients [J]. Journal of System Simulation, 2025, 37(4): 875-881. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||