Journal of System Simulation ›› 2025, Vol. 37 ›› Issue (4): 875-881.doi: 10.16182/j.issn1004731x.joss.23-1524

• Papers • Previous Articles     Next Articles

UAV Path Planning Based on Improved Deep Deterministic Policy Gradients

Zhang Sen, Dai Qiangqiang   

  1. College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
  • Received:2023-12-13 Revised:2024-01-17 Online:2025-04-17 Published:2025-04-16

Abstract:

Aiming at the problems of poor convergence and invalid exploration when UAVs perform path planning in complex environments, an improved deep deterministic policy gradient(DDPG) algorithm is proposed. Using a dual experience pooling mechanism to store success and failure experiences separately, the algorithm is able to use the success experience to strengthen the strategy optimization and learn from the failure experience to avoid the wrong path;an APF method is introduced to add a bootstrap term to the planning, which is combined with the exploration of noisy actions in a randomized sampling process to dynamically integrate the selected actions;multi-objective optimization of path planning is achieved by designing combinatorial reward functions using direction, distance, obstacle avoidance and time reward functions and solving the reward sparsity problem. Experiments show that the proposed algorithm can significantly improve the reward and success rate and reach convergence in a shorter time.

Key words: UAV, DRL, path planning, deep deterministic policy gradient(DDPG), APF

CLC Number: