EGLight: enhancing deep reinforcement learning with expert guidance for traffic signal control

被引:0
作者
Zhang, Meng [1 ,2 ]
Wang, Dianhai [1 ,3 ]
Cai, Zhengyi [4 ]
Huang, Yulang [1 ]
Yu, Hongxin [1 ]
Qin, Hanwu [1 ]
Zeng, Jiaqi [1 ,3 ]
机构
[1] Zhejiang Univ, Coll Civil Engn & Architecture, Hangzhou 310058, Peoples R China
[2] Zhejiang Urban Governance Studies Ctr, Hangzhou, Peoples R China
[3] Zhejiang Univ, Zhongyuan Inst, Zhengzhou 450001, Peoples R China
[4] Hangzhou City Univ, Sch Informat & Elect Engn, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Traffic signal control; deep reinforcement learning; supervised learning; expert policy; LIGHT CONTROL; GRADIENT;
D O I
10.1080/23249935.2025.2486263
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Deep Reinforcement Learning (DRL) is prevalent in traffic signal control. However, the training process often encounters slow learning rate and unstable convergence due to limited state representation and exploratory learning. Inspired by human learning, we incorporate expert guidance in the exploration process to accelerate convergence and enhance performance. The proposed framework, termed Expert-Guided Light (EGLight), contains three moudles. The state perception module combines statistical features with cellular features to enhance model robustness. The decision-making module employs expert-guided learning to promote the learning efficiency. In the learning module, four distinct loss functions are employed to make full use of the interaction experience and update the agent's policy. Extensive tests demonstrate EGLight's superior convergence speed and effectiveness over traditional methods. The analysis shows that the precise feature design is helpful for the agent and proper expert guidance is crucial for the convergence of agent learning process.
引用
收藏
页数:27
相关论文
共 52 条
  • [1] Reinforcement learning: Introduction to theory and potential for transport applications
    Abdulhai, B
    Kattan, L
    [J]. CANADIAN JOURNAL OF CIVIL ENGINEERING, 2003, 30 (06) : 981 - 991
  • [2] Reinforcement learning for True Adaptive traffic signal control
    Abdulhai, B
    Pringle, R
    Karakoulas, GJ
    [J]. JOURNAL OF TRANSPORTATION ENGINEERING, 2003, 129 (03) : 278 - 285
  • [3] Deep Reinforcement Learning A brief survey
    Arulkumaran, Kai
    Deisenroth, Marc Peter
    Brundage, Miles
    Bharath, Anil Anthony
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
  • [4] A MARKOVIAN DECISION PROCESS
    BELLMAN, R
    [J]. JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05): : 679 - 684
  • [5] Deep reinforcement learning for traffic signal control with consistent state and reward design approach
    Bouktif, Salah
    Cheniki, Abderraouf
    Ouni, Ali
    El-Sayed, Hesham
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 267
  • [6] Chen Hanyang., 2024, Advances in Neural Information Processing Systems, V37, P123353
  • [7] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [8] Cools Seung-Bae., 2013, Springer Advances in applied selforganizing systems, P45
  • [9] Genders W., 2016, arXiv
  • [10] Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1