Journal of System Simulation ›› 2025, Vol. 37 ›› Issue (5): 1169-1187.doi: 10.16182/j.issn1004731x.joss.24-0025

• Papers • Previous Articles     Next Articles

A Quadrotor Trajectory Tracking Control Method Based on Deep Reinforcement Learning

Wu Guohua1, Zeng Jiaheng2, Wang Dezhi3, Zheng Long4, Zou Wei5   

  1. 1.School of Automation Central South University, Changsha 410083, China
    2.School of Traffic and Transportation Engineering, Central South University, Changsha 410083, China
    3.School of Meteorology and Oceanography, National University of Defense Technology, Changsha 410015, China
    4.Military Vocational Education Technology Service Center, National University of Defense Technology, Changsha 410015, China
    5.School of Computer Science and Engineering, Central South University, Changsha 410083, China
  • Received:2024-01-08 Revised:2024-03-12 Online:2025-05-20 Published:2025-05-23
  • Contact: Wang Dezhi

Abstract:

Traditional quadrotor controllers, constrained by fixed model equation structures, encounter challenges in addressing control errors stemming from variations in parameters and environmental disturbances. This paper proposes a deep reinforcement learning solution for the quadrotor trajectory following control problem. We present the PPO-SAG algorithm incorporated into the PPO framework, utilizing adaptive mechanisms and PID expert knowledge to enhance training convergence and stability. Target functions incorporating distance constraint penalties and entropy policies are designed in alignment with the characteristics of the given problem.We also devise innovative disturbance-adaptive structures and trajectory feature selection mechanisms to augment control error information and extract crucial elements from future trajectories, thereby enhancing convergence. Experiments on single and mixed trajectories indicate that the PPO-SAG algorithm achieves superior performance in both convergence and stability. Verification experiments confirm positive effects of proposed improvements. The trajectory tracking control problem of quadrotors based on deep reinforcement learning under unknown disturbances studied in this paper provides a solution for designing more robust and efficient quadrotor controllers.

Key words: deep reinforcement learning, track following control, proximal policy optimization(PPO), adaptive mechanism, attention mechanism

CLC Number: