EGLight: enhancing deep reinforcement learning with expert guidance for traffic signal control

被引：0

作者：

Zhang, Meng ^{[1
,2
]}

Wang, Dianhai ^{[1
,3
]}

Cai, Zhengyi ^{[4
]}

Huang, Yulang ^{[1
]}

Yu, Hongxin ^{[1
]}

Qin, Hanwu ^{[1
]}

Zeng, Jiaqi ^{[1
,3
]}

机构：

[1] Zhejiang Univ, Coll Civil Engn & Architecture, Hangzhou 310058, Peoples R China

[2] Zhejiang Urban Governance Studies Ctr, Hangzhou, Peoples R China

[3] Zhejiang Univ, Zhongyuan Inst, Zhengzhou 450001, Peoples R China

[4] Hangzhou City Univ, Sch Informat & Elect Engn, Hangzhou, Peoples R China

来源：

TRANSPORTMETRICA A-TRANSPORT SCIENCE | 2025年

基金：

中国国家自然科学基金;

关键词：

Traffic signal control; deep reinforcement learning; supervised learning; expert policy; LIGHT CONTROL; GRADIENT;

D O I：

10.1080/23249935.2025.2486263

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Deep Reinforcement Learning (DRL) is prevalent in traffic signal control. However, the training process often encounters slow learning rate and unstable convergence due to limited state representation and exploratory learning. Inspired by human learning, we incorporate expert guidance in the exploration process to accelerate convergence and enhance performance. The proposed framework, termed Expert-Guided Light (EGLight), contains three moudles. The state perception module combines statistical features with cellular features to enhance model robustness. The decision-making module employs expert-guided learning to promote the learning efficiency. In the learning module, four distinct loss functions are employed to make full use of the interaction experience and update the agent's policy. Extensive tests demonstrate EGLight's superior convergence speed and effectiveness over traditional methods. The analysis shows that the precise feature design is helpful for the agent and proper expert guidance is crucial for the convergence of agent learning process.

引用

页数：27

共 52 条

[1] Reinforcement learning: Introduction to theory and potential for transport applications
Abdulhai, B
Kattan, L
[J]. CANADIAN JOURNAL OF CIVIL ENGINEERING, 2003, 30 (06) : 981 - 991
[2] Reinforcement learning for True Adaptive traffic signal control
Abdulhai, B
Pringle, R
Karakoulas, GJ
[J]. JOURNAL OF TRANSPORTATION ENGINEERING, 2003, 129 (03) : 278 - 285
[3] Deep Reinforcement Learning A brief survey
Arulkumaran, Kai
Deisenroth, Marc Peter
Brundage, Miles
Bharath, Anil Anthony
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
[4] A MARKOVIAN DECISION PROCESS
BELLMAN, R
[J]. JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05): : 679 - 684
[5] Deep reinforcement learning for traffic signal control with consistent state and reward design approach
Bouktif, Salah
Cheniki, Abderraouf
Ouni, Ali
El-Sayed, Hesham
[J]. KNOWLEDGE-BASED SYSTEMS, 2023, 267
[6] Chen Hanyang., 2024, Advances in Neural Information Processing Systems, V37, P123353
[7] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
Chu, Tianshu
Wang, Jie
Codeca, Lara
Li, Zhaojian
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
[8] Cools Seung-Bae., 2013, Springer Advances in applied selforganizing systems, P45
[9] Genders W., 2016, arXiv
[10] Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1

← 1 2 3 4 5 6 →