系统仿真学报 ›› 2023, Vol. 35 ›› Issue (2): 386-395.doi: 10.16182/j.issn1004731x.joss.21-0835

• 论文 • 上一篇    下一篇

基于机器学习的地铁行人流在线优化控制研究

史佳洁1(), 杨鹏1(), 皮雁南2   

  1. 1.天津理工大学,天津 300000
    2.北京市地铁运营有限公司 地铁运营技术研发中心,北京 100082
  • 收稿日期:2021-08-18 修回日期:2021-11-15 出版日期:2023-02-28 发布日期:2023-02-16
  • 通讯作者: 杨鹏 E-mail:815425329@qq.com;29139475@qq.com
  • 作者简介:史佳洁(1997-),女,硕士生,研究方向为复杂系统仿真。E-mail:815425329@qq.com
  • 基金资助:
    中央高校基本科研业务费(2019JBM032)

Machine Learning-based Simulation Research of On-line Subway Pedestrian Flow Control

Jiajie Shi1(), Peng Yang1(), Yannan Pi2   

  1. 1.Tianjin University of Technology, Tianjin 300000, China
    2.Metro Operation Technology R&D Center, Beijing Metro Operation Co. , Ltd. , Beijing 100082, China
  • Received:2021-08-18 Revised:2021-11-15 Online:2023-02-28 Published:2023-02-16
  • Contact: Peng Yang E-mail:815425329@qq.com;29139475@qq.com

摘要:

为了实现高峰期地铁站行人流管控的在线优化,设计了基于机器学习的地铁站行人流管控算法框架。以某地铁车站早高峰的行人流管控流程为研究对象利用Agent技术搭建地铁站行人流管控仿真模型。多次运行仿真模型可以获得深度学习网络的训练数据。通过对网络进行充分训练,得到优化调度策略。将网络接入地铁站行人流实时运行数据,实现实时优化管控。仿真实验表明:引入的深度强化学习框架可以实现在线优化,调度结果优于传统方法。

关键词: 深度强化学习, 行人流管控, 地铁站调度策略, 在线仿真, 实时优化

Abstract:

For the online optimization of pedestrian flow control in subway station, an algorithm frame for pedestrian flow control in subway station based on machine learning is designed. The pedestrian flow control process of a subway station during morning rush hour is selected,and the agent-based model is built to simulate the control process. The training data is collected through the multiple runs of the model, which is used as the input of deep reinforcement learning network, and the mature net is obtained through adequate training to provide the optimizing scheduling policy. Linking the actual data with the mature net to realize the real-time schedule optimization of subway pedestrian flow control. Simulation experiments show that the framework of the deep reinforcement learning can realize the on-line optimization and the performance is better than traditional algorithm.

Key words: deep reinforcement learning, pedestrian flow control, scheduling strategy in subway station, online simulation, real-time optimization

中图分类号: