Journal of System Simulation ›› 2025, Vol. 37 ›› Issue (3): 584-594.doi: 10.16182/j.issn1004731x.joss.24-0098

• Papers • Previous Articles    

An Intelligent Ambulance Regulation Model Based on Online Reinforcement Learning Algorithm

Zhang Lei1,2, Zhang Xuechao2, Wang Chao2, Bo Xianglei3   

  1. 1.Joint Operations College, National Defence University, Beijing 100000, China
    2.Joint Logistics College, National Defence University, Beijing 100858, China
    3.Automobile NCO Academy, Army Military Transportation University of PLA, Bengbu 233011, China
  • Received:2024-01-24 Revised:2024-02-05 Online:2025-03-17 Published:2025-03-21

Abstract:

In emergency scenarios where ambulances are used to evacuate casualties, it is necessary to fully coordinate the rescue capability of the ambulance with the real-time status of the casualties in the scenario to achieve the best rescue results. Such problems are generally non-deterministic polynomial problems, and the traditional deterministic scheduling algorithms are less effective. This paper aimed at the modeling research of the real-time regulation of ambulances in emergency scenarios, an online reinforcement learning DNQ algorithm frameworks based on the data enhancement method is proposed and applied to the solution of the ambulances control model. To solve the problems of poor repeatability in emergency scenarios and slow training of the agent due to the low accumulation speed of learning samples, a DA-DQN method combining the data augmentation method on the basis of the traditional DQN algorithm is proposed. Simulation results show that several classical DQN methods can be trained online to obtain agent, achieving better scheduling results than deterministic algorithms. The treatment failure rate achieved by classical "first-come, first-served" algorithm scheduling is about 45.4%, while the medical failure rate after DQN agent convergence is about 25%.The agent training speed of DA-DQN method is much faster than that of traditional DQN method.Moreover, it has practical application potential in emergency rescue operation regulation.

Key words: emergency scenario, ambulance evacuation, online reinforcement learning, data augmentation, action control optimization

CLC Number: