Journal of System Simulation ›› 2025, Vol. 37 ›› Issue (8): 2016-2029.doi: 10.16182/j.issn1004731x.joss.0124-0243

• Papers • Previous Articles    

Optimization of Product Oil Distribution with Multiple Trips and Multiple Due Dates under Dynamic Demand

Xie Yong1, Gao Hailong1, Chen Yutao2, Wang Huanjiang1   

  1. 1.School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China
    2.School of Power Engineering, Naval University of Engineering, Wuhan 430030, China
  • Received:2024-03-14 Revised:2024-06-21 Online:2025-08-20 Published:2025-08-26
  • Contact: Chen Yutao

Abstract:

In the case of dynamic demand, considering the order due date, vehicle transportation time window, and other factors, this paper developed an optimization model of periodic product oil distribution with multiple trips and multiple due dates to maximize the distribution revenue. The paper also designed a reinforcement learning-based large neighborhood search algorithm to solve the problem. The initial solution was constructed based on the forward insertion heuristic algorithm. Then, a deep reinforcement learning model for neighborhood operator selection was designed. By fitting the action value function through the double deep Q network, the optimal neighborhood operator was selected, and the optimal distribution scheme was obtained. The experimental results show that the large neighborhood search algorithm based on reinforcement learning proposed in this paper can effectively improve the solving speed while ensuring the solution's quality.

Key words: product oil distribution, dynamic demand, multiple trips, multiple due dates, RL

CLC Number: