Journal of System Simulation ›› 2026, Vol. 38 ›› Issue (2): 321-331.doi: 10.16182/j.issn1004731x.joss.25-0768

• Machine Learning Algorithms • Previous Articles    

Simulation of Robotic Arm Ball-catching Strategy Based on Curriculum RL of Transformer

Zhang Ziyao1, Ji Yunfeng2   

  1. 1.School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
    2.Institute of Machine Intelligence, University of Shanghai for Science and Technology, Shanghai 200093, China
  • Received:2025-08-12 Revised:2025-09-22 Online:2026-02-18 Published:2026-02-11
  • Contact: Ji Yunfeng

Abstract:

method integrating the PPO algorithm with Transformer network architecture is proposed, and curriculum learning strategy is introduced to solve the difficult training convergence and low efficiency of traditional RL methods in complex and dynamic high-degree-of-freedom tasks such as robotic arm ball catching. The Transformer is employed to effectively capture the complex high-dimensional dependency between the robotic arm's state space, ball trajectory, and environmental physical parameters. Curriculum learning progressively increases catching difficulty by designing training tasks from simple to complex objectives. The experimental results show this method increases the ball-catching success rate by over 60% compared to the traditional PPO and features excellent accuracy at tracking balls with real-world disturbance characteristics. This method not only enhances the performance and efficiency of dynamic catching for robotic arms in both simulated and real-world disturbance conditions, but also provides a novel solution for complex task control in real-world scenarios.

Key words: RL, curriculum learning, Transformer, robotic arm, ball-catching control

CLC Number: