系统仿真学报 ›› 2026, Vol. 38 ›› Issue (2): 518-531.doi: 10.16182/j.issn1004731x.joss.25-0921

• 应急管理应用 • 上一篇    

知识增强大语言模型的区域交通信号控制方法

胥日升1, 杨林瑶2, 覃缘琪2, 王晓3, 孙长银4   

  1. 1.安徽大学 人工智能学院,安徽 合肥 230601
    2.之江实验室,浙江 杭州 311121
    3.光电信息获取与防护技术全国重点实验室,安徽 合肥 230031
    4.安徽大学,安徽 合肥 230601
  • 收稿日期:2025-09-23 修回日期:2025-12-25 出版日期:2026-02-18 发布日期:2026-02-11
  • 通讯作者: 王晓
  • 第一作者简介:胥日升(2001-),男,硕士生,研究方向为社会认知计算及其在自主无人系统中的应用。
  • 基金资助:
    国家自然科学基金(62173329)

Knowledge-enhanced LLM-based Method for Regional Traffic Signal Control

Xu Risheng1, Yang Linyao2, Qin Yuanqi2, Wang Xiao3, Sun Changyin4   

  1. 1.School of Artificial Intelligence, Anhui University, Hefei 230601, China
    2.Zhijiang Laboratory, Hangzhou 311121, China
    3.National Key Laboratory of Optoelectronic Information Acquisition and Protection Technology, Hefei 230031, China
    4.Anhui University, Hefei 230601, China
  • Received:2025-09-23 Revised:2025-12-25 Online:2026-02-18 Published:2026-02-11
  • Contact: Wang Xiao

摘要:

自适应交通信号控制是缓解区域交通拥堵的关键,但其在突发事件实时响应与全局协同方面仍面临严峻挑战。为解决DRL方法依赖纯数据驱动,存在泛化性差、可解释性弱、缺乏突发事件处置知识指导难以满足复杂交通场景需求的问题,提出一种结合知识驱动和数据优化的控制系统。通过GraphRAG构建动态交通知识图谱,为LLM提供实时更新的历史突发事件处置经验与路网拓扑知识;采用LoRA微调技术,在避免LLM全量训练高成本的同时,精准注入交通信号控制领域专业知识;设计结构化提示词拼接区域路网状态与相关上下文,使LLM具备区域协同与事件响应能力。仿真实验表明:相比传统及强化学习基线最优方法,该方法平均通行时间分别降低9.85%、4.87%和17.4%,平均等待时间在济南与Random场景中分别降低4.17%和6.76%,吞吐量分别提升0.95%、3.71%和8.53%,且在Random场景下性能提升尤为显著。

关键词: 交通信号控制, 知识图谱, 图检索增强生成, 微调, 大语言模型, 强化学习

Abstract:

Adaptive traffic signal control (ATSC) is crucial for alleviating regional traffic congestion, yet it faces severe challenges in real-time response to unexpected events and global coordination. The DRL method relies on pure data-driven approaches, suffering from core limitations such as poor generalization, weak interpretability, and a lack of guidance from emergency disposal knowledge, which makes them difficult to meet the demands of complex traffic scenarios. A control system that integrates knowledge-driven and data-optimized approaches was proposed. The GraphRAG was used to construct a dynamic traffic knowledge graph, providing LLMs with real-time updated historical emergency disposal experience and road network topology knowledge. The LoRA fine-tuning technologywas adopted to accurately inject professional knowledge in the field of traffic signal control while avoiding the high cost of full-scale LLM training. A structured Prompt was also designed to concatenate regional road network states and relevant contexts, enabling LLMs to possess regional coordination and event response capabilities. Simulation experiments show that compared with traditional methods and the optimal reinforcement learning baseline method, this approach reduces the average travel time by 9.85%, 4.87%, and 17.4% respectively, shortens the average waiting time by 4.17% (Ji′nan dataset) and 6.76% (Random scenario) respectively, and increases the throughput by 0.95%, 3.71%, and 8.53% respectively. In addition, the performance improvement is particularly significant in the Random scenario.

Key words: traffic signal control, knowledge graph, graph retrieval-augmented generation, fine-tuning, LLMs, RL

中图分类号: