系统仿真学报 ›› 2025, Vol. 37 ›› Issue (10): 2578-2593.doi: 10.16182/j.issn1004731x.joss.24-0494
• 论文 • 上一篇
梁秀满, 刘子良, 刘振东
收稿日期:2024-05-08
修回日期:2024-09-12
出版日期:2025-10-20
发布日期:2025-10-21
通讯作者:
刘子良
第一作者简介:梁秀满(1973-),女,副教授,硕导,硕士,研究方向为机器学习、强化学习等。
基金资助:Liang Xiuman, Liu Ziliang, Liu Zhendong
Received:2024-05-08
Revised:2024-09-12
Online:2025-10-20
Published:2025-10-21
Contact:
Liu Ziliang
摘要:
针对RRT算法在三维复杂场景中规划全局路径时存在规划效率低、安全性和实用性较差而无法满足无人机对飞行路径的安全需求,提出SAC深度强化学习算法与RRT算法融合的SAC-RRT算法。设计基于SAC算法决策网络的目标点偏置策略和动态步长策略,降低RRT盲目性;设计随机点修正过程,根据决策网络输出动作优化随机点位置,改善路径安全性;设计精简步骤和平滑步骤,进一步提高路径安全性。设计了不同复杂程度的三维场景,规划结果表明:SAC-RRT算法有效缩短了路径长度和规划时间,改善了路径的平滑性和安全性。
中图分类号:
梁秀满,刘子良,刘振东 . 基于深度强化学习的改进RRT算法路径规划[J]. 系统仿真学报, 2025, 37(10): 2578-2593.
Liang Xiuman,Liu Ziliang,Liu Zhendong . Path Planning of Improved RRT Algorithm Based on Deep Reinforcement Learning[J]. Journal of System Simulation, 2025, 37(10): 2578-2593.
| [1] | 陈锦涛, 李鸿一, 任鸿儒, 等. 基于RRT森林算法的高层消防多无人机室内协同路径规划[J]. 自动化学报, 2023, 49(12): 2615-2626. |
| Chen Jintao, Li Hongyi, Ren Hongru, et al. Cooperative Indoor Path Planning of Multi-UAVs for High-rise Fire Fighting Based on RRT-forest Algorithm[J]. Acta Automatica Sinica, 2023, 49(12): 2615-2626. | |
| [2] | Sun Yinghui, Fang Ming, Su Yixin. AGV Path Planning Based on Improved Dijkstra Algorithm[J]. Journal of Physics: Conference Series, 2021, 1746(1): 012052. |
| [3] | Zhang Jing, Wu Jun, Shen Xiao, et al. Autonomous Land Vehicle Path Planning Algorithm Based on Improved Heuristic Function of A-star[J]. International Journal of Advanced Robotic Systems, 2021, 2021(9): 17298814211042730. |
| [4] | 李琼琼, 徐溢琪, 布升强, 等. 基于修正PRM算法的智能车辆路径规划研究[J]. 森林工程, 2022, 38(5): 179-186. |
| Li Qiongqiong, Xu Yiqi, Bu Shengqiang, et al. Smart Vehicle Path Planning Based on Modified PRM Algorithm[J]. Forest Engineering, 2022, 38(5): 179-186. | |
| [5] | Katoch Sourabh, Sumit Singh Chauhan, Kumar Vijay. A Review on Genetic Algorithm: Past, Present, and Future[J]. Multimedia Tools and Applications, 2021, 80(5): 8091-8126. |
| [6] | 于力涵, 洪儒, 吴宇伦, 等. 基于IKGC-PSO算法的无人机三维路径规划系统[J]. 计算机测量与控制, 2023, 31(8): 259-266. |
| Yu Lihan, Hong Ru, Wu Yulun, et al. UAV 3D Path Planning System Based on IKGC-PSO Algorithm[J]. Computer Measurement & Control, 2023, 31(8): 259-266. | |
| [7] | Yuan Qingni, Yi Junhui, Sun Ruitong, et al. Path Planning of a Mechanical Arm Based on an Improved Artificial Potential Field and a Rapid Expansion Random Tree Hybrid Algorithm[J]. Algorithms, 2021, 14(11): 321. |
| [8] | 黄岩松, 姚锡凡, 景轩, 等. 基于深度Q网络的多起点多终点AGV路径规划[J]. 计算机集成制造系统, 2023, 29(8): 2550-2562. |
| Huang Yansong, Yao Xifan, Jing Xuan, et al. DQN-based AGV Path Planning for Situations with Multi-starts and Multi-targets[J]. Computer Integrated Manufacturing Systems, 2023, 29(8): 2550-2562. | |
| [9] | 周治国, 余思雨, 于家宝, 等. 面向无人艇的T-DQN智能避障算法研究[J]. 自动化学报, 2023, 49(8): 1645-1655. |
| Zhou Zhiguo, Yu Siyu, Yu Jiabao, et al. Research on T-DQN Intelligent Obstacle Avoidance Algorithm of Unmanned Surface Vehicle[J]. Acta Automatica Sinica, 2023, 49(8): 1645-1655. | |
| [10] | Karaman S, Walter M R, Perez Alejandro, et al. Anytime Motion Planning Using the RRT*[C]//2011 IEEE International Conference on Robotics and Automation. Piscataway: IEEE, 2011: 1478-1483. |
| [11] | Kuffner J J, LaValle S M. RRT-connect: An Efficient Approach to Single-query Path Planning[C]//Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings. Piscataway: IEEE, 2000: 995-1001. |
| [12] | Karaman S, Frazzoli E. Sampling-based Algorithms for Optimal Motion Planning[J]. International Journal of Robotics Research, 2011, 30(7): 846-894. |
| [13] | Klemm Sebastian, Oberländer Jan, Hermann Andreas, et al. RRT*-connect: Faster, Asymptotically Optimal Motion Planning[C]//2015 IEEE International Conference on Robotics and Biomimetics (ROBIO). Piscataway: IEEE, 2015: 1670-1677. |
| [14] | 王冠强, 张驰洲, 陈明松, 等. 融合RRT-connect和DWA算法的室内移动机器人单目标点导航任务研究[J]. 中南大学学报(自然科学版), 2023, 54(11): 4326-4337. |
| Wang Guanqiang, Zhang Chizhou, Chen Mingsong, et al. Research on Single-target Point Navigation Task of Indoor Mobile Robot Integrating RRT-connect and DWA Algorithms[J]. Journal of Central South University(Science and Technology), 2023, 54(11): 4326-4337. | |
| [15] | Chiang H T L, Hsu J, Fiser M, et al. RL-RRT: Kinodynamic Motion Planning via Learning Reachability Estimators from RL Policies[J]. IEEE Robotics and Automation Letters, 2019, 4(4): 4298-4305. |
| [16] | Haarnoja T, Zhou A, Abbeel P, et al. Soft Actor-critic: Off-policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor[C]//Proceedings of the 35th International Conference on Machine Learning. Chia Laguna Resort: PMLR, 2018: 1861-1870. |
| [17] | Kurniawati H. Partially Observable Markov Decision Processes and Robotics[J]. Annual Review of Control, Robotics, and Autonomous Systems, 2022, 5: 253-277. |
| [18] | Konda V R, Tsitsiklis J N. Actor-citic Agorithms[C]//Proceedings of the 13th International Conference on Neural Information Processing Systems. Cambridge: MIT Press, 1999: 1008-1014. |
| [19] | 杨来义, 毕敬, 苑海涛. 基于SAC算法的移动机器人智能路径规划[J]. 系统仿真学报, 2023, 35(8): 1726-1736. |
| Yang Laiyi, Bi Jing, Yuan Haitao. Intelligent Path Planning for Mobile Robots Based on SAC Algorithm[J]. Journal of System Simulation, 2023, 35(8): 1726-1736. | |
| [20] | 罗征志, 韩怡可, 张鑫, 等. 改进RRT-connect与DWA算法的巡检机器人路径规划研究[J]. 计算机工程与应用, 2024, 60(15): 344-354. |
| Luo Zhengzhi, Han Yike, Zhang Xin, et al. Research on Path Planning of Inspection Robot with Improved RRT-connect and DWA Algorithm[J]. Computer Engineering and Applications, 2024, 60(15): 344-354. | |
| [21] | Prautzsch H, Boehm W, Paluszny M. Bézier and B-spline Techniques[M]. Berlin: Springer Science & Business Media, 2002. |
| [1] | 江明, 何韬. 基于深度强化学习的带容量约束车辆路径问题求解[J]. 系统仿真学报, 2025, 37(9): 2177-2187. |
| [2] | 江好胜, 武芳芳, 黄泽贤, 马子玥, 董春云, 平续斌. 动态障碍物环境下多四旋翼轨迹规划与跟踪[J]. 系统仿真学报, 2025, 37(8): 2089-2102. |
| [3] | 陈真, 吴卓屹, 张霖. 深度强化学习中策略表征研究简述[J]. 系统仿真学报, 2025, 37(7): 1753-1769. |
| [4] | 伍国华, 曾家恒, 王得志, 郑龙, 邹伟. 基于深度强化学习的四旋翼航迹跟踪控制方法[J]. 系统仿真学报, 2025, 37(5): 1169-1187. |
| [5] | 屈长虹, 王俊杰, 王坤, 崔清勇, 陈蒋洋, 王鑫鹏. 基于联合DQN的定向能系统火力智能决策建模仿真方法[J]. 系统仿真学报, 2025, 37(5): 1256-1265. |
| [6] | 张森, 代强强. 改进型深度确定性策略梯度的无人机路径规划[J]. 系统仿真学报, 2025, 37(4): 875-881. |
| [7] | 李敏, 张森, 曾祥光, 王刚, 张童伟, 谢地杰, 任文哲, 张滔. 基于深度强化学习的四足机器人单腿越障轨迹规划[J]. 系统仿真学报, 2025, 37(4): 895-909. |
| [8] | 王贺, 许佳宁, 闫广宇. 基于深度强化学习的AGV行人避让策略研究[J]. 系统仿真学报, 2025, 37(3): 595-606. |
| [9] | 杨超, 郑瑞群, 李圳, 张鸿薇, 唐燕群, 李东泽. 面向无人机辅助车联网的并行任务传输与处理优化策略[J]. 系统仿真学报, 2025, 37(3): 635-645. |
| [10] | 张斌, 雷永林, 李群, 高远, 陈永, 朱佳俊, 鲍琛龙. 基于强化学习的导弹突防决策建模研究[J]. 系统仿真学报, 2025, 37(3): 763-774. |
| [11] | 胡世军, 刘海亮, 王兵雷, 苏文科. 基于定向探索树算法的四旋翼无人机路径规划[J]. 系统仿真学报, 2025, 37(2): 311-324. |
| [12] | 黄思进, 文佳, 陈哲毅. 面向边缘车联网系统的智能服务迁移方法[J]. 系统仿真学报, 2025, 37(2): 379-391. |
| [13] | 费帅迪, 蔡长龙, 刘飞, 陈明晖, 刘晓明. 舰船防空反导的目标分配方法研究[J]. 系统仿真学报, 2025, 37(2): 508-516. |
| [14] | 徐忠锴, 储晨阳, 解凯, 赵睿卓, 柯文俊. 基于SC-PPO的高比例新能源电力系统优化调度方法[J]. 系统仿真学报, 2025, 37(10): 2511-2521. |
| [15] | 黄智钦, 卢恬英, 陈哲毅. 面向大规模IoT系统的多无人机部署与协作卸载[J]. 系统仿真学报, 2025, 37(1): 25-39. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||