系统仿真学报 ›› 2022, Vol. 34 ›› Issue (6): 1286-1295.doi: 10.16182/j.issn1004731x.joss.21-0112

• 仿真建模理论与方法 • 上一篇    下一篇

基于随机策略搜索的多机三维路径规划方法

张森(), 张孟炎, 邵敬平, 普杰信   

  1. 河南科技大学 信息工程学院,河南 洛阳 471023
  • 收稿日期:2021-02-05 修回日期:2021-06-16 出版日期:2022-06-30 发布日期:2022-06-16
  • 作者简介:张森(1984-),男,博士,副教授,研究方向为智能机器人控制,水下光场建模与仿真,智能图像处理。 E-mail:zhangsen_hust@163.com

Multi-UAVs 3D Path Planning Method Based on Random Strategy Search

Sen Zhang(), Mengyan Zhang, Jingping Shao, Jiexin Pu   

  1. College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
  • Received:2021-02-05 Revised:2021-06-16 Online:2022-06-30 Published:2022-06-16

摘要:

针对传统无能耗约束的多无人机路径规划方法难以适应复杂山地作业环境的应急救援要求,提出了一种基于LSTM-DPPO(long short-term memory-distributed proximal policy optimization)框架的多无人机三维路径规划算法。利用LSTM长短期记忆神经网络提取出多无人机在各自飞行过程中的重要特征状态信息序列,经过多次迭代更新后得到一个最优网络参数模型,结合能耗生成最优的三维探测路径。实验结果表明:该方法相对于传统路径规划方法效果显著,能在能耗最小的前提下规划出最优探测路径。

关键词: 多无人机, 深度强化学习算法, 神经网络, 三维路径规划, 能耗

Abstract:

In view of the difficulty of the traditional path planning method without energy consumption constraints to meet the emergency rescue requirements in the complex mountain operation environment, a three-dimensional path planning algorithm for multi-UAVs is proposed based on LSTM-DPPO(long short-term memory-distributed proximal policy optimization) framework. The LSTM long and short-term memory neural network is used to extract the important characteristic state information sequence of the multiple unmanned aerial vehicles in their respective flight process. After repeated iteration and updating, an optimal network parameter model is obtained. Combined with the energy consumption, the optimal 3D detection path is generated. Simulation experiments verify that the proposed method is more effective than the traditional path planning method and can plan the optimal detection path with the minimum energy consumption.

Key words: multi-UAVs, deep reinforcement learning algorithms, neural network, 3D path planning, energy consumption

中图分类号: