Journal of System Simulation ›› 2019, Vol. 31 ›› Issue (10): 2155-2163.doi: 10.16182/j.issn1004731x.joss.18-0820

Previous Articles     Next Articles

Dynamic Inventory Routing Optimization Based on Deep Reinforcement Learning

Zhou Jianpin1, Zhang Shuliu2   

  1. 1. School of Navigation, Jimei University, Xiamen 361021, China;
    2. Jilin Power Supply Company of State Grid, Jilin 132000, China
  • Received:2018-12-10 Revised:2019-02-15 Online:2019-10-10 Published:2019-12-12

Abstract: Aiming at the dynamic stochastic inventory routing problem with periodic fluctuation of demand, a novel simulation optimization approach based on deep reinforcement learning is proposed to achieving periodic steady strategy. Firstly a dynamic combinatorial optimization model is constructed. Then, by deep reinforcement learning and setting heuristic rules, the replenishment nodes set selection and the replenishment batch allocation weights in each period are determined. The simulation experimental results show that the proposed method can improve the average profit of a cycle by about 2.7% and 3.9% in low fluctuating demand case and by about 8.2% and 7.1% in high fluctuating demand case compared with the two solution methods in the existing literature, and the cycle service level can be stabilized within a small fluctuation range under different demand fluctuation environments.

Key words: inventory routing problem, heuristic rules, deep Q-learning, dynamics, periodic steady strategy

CLC Number: