基于机器学习的地铁行人流在线优化控制研究

doi:10.16182/j.issn1004731x.joss.21-0835

系统仿真学报 ›› 2023, Vol. 35 ›› Issue (2): 386-395.doi: 10.16182/j.issn1004731x.joss.21-0835

基于机器学习的地铁行人流在线优化控制研究

史佳洁¹(), 杨鹏¹(), 皮雁南²

^1.天津理工大学，天津 300000
^2.北京市地铁运营有限公司地铁运营技术研发中心，北京 100082

收稿日期:2021-08-18 修回日期:2021-11-15 出版日期:2023-02-28 发布日期:2023-02-16
通讯作者: 杨鹏 E-mail:815425329@qq.com;29139475@qq.com
作者简介:史佳洁(1997-)，女，硕士生，研究方向为复杂系统仿真。E-mail：815425329@qq.com
基金资助:
中央高校基本科研业务费(2019JBM032)

Machine Learning-based Simulation Research of On-line Subway Pedestrian Flow Control

Jiajie Shi¹(), Peng Yang¹(), Yannan Pi²

^1.Tianjin University of Technology, Tianjin 300000, China
^2.Metro Operation Technology R&D Center, Beijing Metro Operation Co. , Ltd. , Beijing 100082, China

Received:2021-08-18 Revised:2021-11-15 Online:2023-02-28 Published:2023-02-16
Contact: Peng Yang E-mail:815425329@qq.com;29139475@qq.com

摘要/Abstract

摘要：

为了实现高峰期地铁站行人流管控的在线优化，设计了基于机器学习的地铁站行人流管控算法框架。以某地铁车站早高峰的行人流管控流程为研究对象，利用Agent技术搭建地铁站行人流管控仿真模型。多次运行仿真模型可以获得深度学习网络的训练数据。通过对网络进行充分训练，得到优化调度策略。将网络接入地铁站行人流实时运行数据，实现实时优化管控。仿真实验表明：引入的深度强化学习框架可以实现在线优化，调度结果优于传统方法。

关键词: 深度强化学习, 行人流管控, 地铁站调度策略, 在线仿真, 实时优化

Abstract:

For the online optimization of pedestrian flow control in subway station, an algorithm frame for pedestrian flow control in subway station based on machine learning is designed. The pedestrian flow control process of a subway station during morning rush hour is selected,and the agent-based model is built to simulate the control process. The training data is collected through the multiple runs of the model, which is used as the input of deep reinforcement learning network, and the mature net is obtained through adequate training to provide the optimizing scheduling policy. Linking the actual data with the mature net to realize the real-time schedule optimization of subway pedestrian flow control. Simulation experiments show that the framework of the deep reinforcement learning can realize the on-line optimization and the performance is better than traditional algorithm.

Key words: deep reinforcement learning, pedestrian flow control, scheduling strategy in subway station, online simulation, real-time optimization

中图分类号:

TP391.9

史佳洁, 杨鹏, 皮雁南. 基于机器学习的地铁行人流在线优化控制研究[J]. 系统仿真学报, 2023, 35(2): 386-395.

Jiajie Shi, Peng Yang, Yannan Pi. Machine Learning-based Simulation Research of On-line Subway Pedestrian Flow Control[J]. Journal of System Simulation, 2023, 35(2): 386-395.

图/表 18

图1

图2

图3

表1

表2

表3

表4

各因素权重取值

权重系数	通过一致性检验的权重
$α 1$	0.1
$α 2$	0.3
$α 3$	0.2
$α 4$	0.4

表4

图4

图5

图6

表5

表6

表7

表8

图7

图8

图9

图10

参考文献 14

1	王媛媛. 基于行人动力学的地铁车站客流仿真模型[J].电子测试, 2018(12): 49-50, 48.
	Wang Yuanyuan. Passenger Flow Simulation Model for Metro Station Based on Pedestrian Dynamics[J]. Electrical Test, 2018(12): 49-50, 48.
2	李建华, 陈伟, 陈祥儒. 基于AnyLogic的人流增多地铁站系统再优化[J]. 科学技术与工程, 2020, 20(33): 13847-13851.
	Li Jianhua, Chen Wei, Chen Xiangru. Simulation and Optimization of Metro Station System Based on AnyLogic[J]. Science Technology and Engineering, 2020, 20(33): 13847-13851.
3	鲁晨, 孙健, 杨涛. 综合交通枢纽客流行人换乘组织优化及仿真[J]. 系统工程, 2020, 38(3): 84-91.
	Lu Chen, Sun Jian, Yang Tao. Optimum Design and Simulation of Pedestrian Passenger Transfer Operation in Integrated Transportation Hub[J]. Systems Engineering, 2020, 38(3): 84-91.
4	杨天阳, 朱志国. 基于AnyLogic的地铁车站通道设施设备规模与布局分析[J]. 交通运输工程与信息学报, 2017, 15(1): 115-121.
	Yang Tianyang, Zhu Zhiguo. Scale and Layout Analyses of Subway Station Facilities Based on AnyLogic Software[J]. Journal of Transportation Engineering and Information, 2017, 15(1): 115-121.
5	Caliendo Ciro, Ciambelli Paolo, Rossella Del Regno, et al. Modelling and Numerical Simulation of Pedestrian Flow Evacuation from a Multi-Storey Historical Building in the Event of Fire Applying Safety Engineering Tools[J]. Journal of Cultural Heritage(S1296-2074), 2020, 41: 188-199.
6	龙顺忠, 王志建, 刘皓. 网联环境下基于深度强化学习的单路口交通信号控制优化[J]. 工业控制计算机, 2020, 33(10): 16-19, 22.
	Long Shunzhong, Wang Zhijian, Liu Hao. Based on Deep Reinforcement Learning Optimization of Traffic Signal Control at Single Intersection[J]. Industrial Control Computer, 2020, 33(10): 16-19, 22.
7	Huang Yicai, Chen Jiayuan, Duan Qiannan, et al. A Fast Antibiotic Detection Method for Simplified Pretreatment Through Spectra-Based Machine Learning[J]. Frontiers of Environmental Science & Engineering(S2095-2201), 2022, 16(3): 135-146.
8	Abdelaali Hassaine, Salimi Khorshidi Gholamreza, Dexter Canoy, et al. Untangling the Complexity of Multimorbidity with Machine Learning[J]. Mechanisms of Ageing and Development(S0047-6374), 2020, 190: 111325.
9	张琪, 曾俊杰, 许凯, 等. 基于机器学习的计算机生成兵力行为建模研究综述[J]. 系统仿真学报, 2021, 33(2): 280-287.
	Zhang Qi, Zeng Junjie, Xu Kai, et al. Behavior Modeling for Computer Gederated Forces Based on Machine Learning[J]. Journal of System Simulation, 2021, 33(2): 280-287.
10	张伟罡, 龚希武. 基于机器学习的工业炉智能控制与实现[J]. 工业炉, 2020, 42(3): 45-49.
	Zhang Weigang, Gong Xiwu. Intelligent Control and Implementation of Industrial Furnace Based on Machine Learning[J]. Industrial Furnace, 2020, 42(3): 45-49.
11	李增有. 基于机器学习的雷达目标跟踪算法研究[J]. 信息技术与信息化, 2020(8): 217-219.
	Li Zengyou. Research on Radar Target Tracking Algorithm Based on Machine Learning[J]. Information Technology and Informatization, 2020(8): 217-219.
12	张荣霞, 武长旭, 孙同超, 等. 深度强化学习及在路径规划中的研究进展[J].计算机工程与应用, 2021, 57(19): 44-56.
	Zhang Rongxia, Wu Changxu, Sun Tongchao, et al. Progress on Deep Reinforcement Learning in Path Planning[J]. Computer Engineering and Applications, 2021, 57(19): 44-56.
13	Wang Haonan, Liu Ning, Zhang Yiyun, et al. Deep Reinforcement Learning: a Survey[J]. Frontiers of Information Technology & Electronic Engineering(S2095-9184), 2020, 21(12): 1726-1744.
14	Neeraj R R, Nithin R P, Niranjhan P, et al. Modelling and Simulation of Discrete Manufacturing Industry[J]. Materials Today: Proceedings(S2214-7853), 2018, 5(11): 24971-24983.

时段	该时段首发时间	时间间隔
07:00—08:00	07:01	每4 min一趟
08:00—09:00	08:00	每3 min一趟
09:00—10:00	09:01	每3 min一趟

参数	初值
安检通道数量/个	16
人工安检数量/个	4
闸机数量/个	84
隔离栏长度/m	20
电梯数量/个	24

参数	初值
安检速度/(个/m)	Triangular(2.5, 5.5, 2.5)
行人行走速度/(m/s)	Triangular(0.8, 1.6, 0.8)

输入	变量取值
当前时刻观测变量	站外限行区行人流密度
	安检区行人流密度
	闸机通行行人流密度
	站台行人流密度
上一时刻控制变量	安检通道数量
	人工安检通道数量
	闸机数量
	电动步梯数量

输出	变量取值
下一时刻控制变量	安检通道数量
	人工安检通道数量
	闸机数量
	电动步梯数量

基于机器学习的地铁行人流在线优化控制研究

Machine Learning-based Simulation Research of On-line Subway Pedestrian Flow Control

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

图/表 18

参考文献 14

相关文章 6

编辑推荐

Metrics

本文评价

优化参数	最小值	最大值	步长	初值
安检通道数量	1	16	1	8
闸机开放数量	1	84	1	50
电动步梯数量	0	24	1	10

[1]	张森, 张孟炎, 邵敬平, 普杰信. 基于随机策略搜索的多机三维路径规划方法[J]. 系统仿真学报, 2022, 34(6): 1286-1295.
[2]	倪凌佳, 黄晓霞, 李红旮, 张子博. 基于协作式深度强化学习的火灾应急疏散仿真研究[J]. 系统仿真学报, 2022, 34(6): 1353-1366.
[3]	王红微, 杨鹏. 基于深度强化学习的机场货运业务优化研究[J]. 系统仿真学报, 2022, 34(3): 651-660.
[4]	李启锐, 彭心怡. 基于深度强化学习的云作业调度及仿真研究[J]. 系统仿真学报, 2022, 34(2): 258-268.
[5]	高昂, 董志明, 张国辉, 梁涛, 郭齐胜. LVC训练系统中计算机生成兵力生成技术研究[J]. 系统仿真学报, 2021, 33(3): 745-752.
[6]	曾贲, 房霄, 孔德帅, 宋祥祥, 贾政轩, 林廷宇. 一种数据驱动的对抗博弈智能体建模方法[J]. 系统仿真学报, 2021, 33(12): 2838-2845.