Hybrid CNN-LSTM and Proximal Policy Optimization Model for Traffic Light Control in a Multi-Agent Environment

被引：0

作者：

Faqir, Nada ^{[1
]}

Ennaji, Yassine ^{[1
]}

Chakir, Loqman ^{[1
]}

Boumhidi, Jaouad ^{[1
]}

机构：

[1] Sidi Mohamed Ben Abdellah Univ, Fac Sci Dhar El Mahraz, LISAC Lab, Fes 30003, Morocco

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Adaptive traffic control; CNN-LSTM; multi-agent reinforcement learning; proximal policy optimization; smart cities; spatio-temporal prediction; SUMO; traffic flow optimization; traffic light control; urban traffic management; NETWORK;

D O I：

10.1109/ACCESS.2025.3541042

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Conventional traffic light control systems often exhibit rigid timing patterns, limited flexibility, and insufficient adaptability to changing traffic conditions. This paper addresses urban traffic management challenges, addressing researchers and professionals in traffic engineering and intelligent transportation systems. To overcome these challenges, this paper presents an innovative traffic light control method, integrating a CNN-LSTM model for traffic state prediction, combined with a Proximal Policy Optimization (PPO) algorithm for traffic light control decision-making. The model is based on a representation of intersection states through key indicators (such as active green phase, congestion levels, congestion variations, and vehicle speeds) and employs deep reinforcement learning to optimize traffic light control strategies. The adopted method is compared with fixed traffic light control approaches and a reinforcement learning (Q-learning) approach in a simulated environment using SUMO. The simulations consider diverse traffic scenarios and realistic urban conditions to ensure robust evaluation. The experimental results indicate that traffic efficiency is significantly improved by up to 92% in scenarios managing medium traffic demand, with up to 2000 vehicles per hour in the North-South scenario, while congestion indicators are substantially reduced. These improvements are achieved under conditions where traffic remains below the saturation threshold, ensuring stable flow management.The proposed method demonstrates its effectiveness in optimizing signalized intersections, significantly enhancing traffic flow in pre-saturation conditions, while opening perspectives for further research in oversaturated networks. These results illustrate the power of integrating spatiotemporal prediction and PPO-based control for dynamic and adaptive traffic management in urban networks.

引用

页码：29577 / 29588

页数：12

共 34 条

[1] Reinforcement learning: Introduction to theory and potential for transport applications [J].

Abdulhai, B ;

Kattan, L .

CANADIAN JOURNAL OF CIVIL ENGINEERING, 2003, 30 (06) :981-991

[2]

[Anonymous], 2012, Int. J Adv. Syst. Meas.

[3] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control [J].

Bazzan, Ana L. C. .

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) :342-375

[4]

Chen RX, 2022, Arxiv, DOI arXiv:2206.11996

[5]

De Oliveira D., 2006, P CEUR WORKSH, P31

[6] SUMO's Lane-Changing Model [J].

Erdmann, Jakob .

MODELING MOBILITY WITH OPEN DATA, 2015, :105-123

[7]

Feng X., 2024, P ACM INT C AUT AG M, P873

[8] Large-Scale Traffic Signal Control Based on Integration of Adaptive Subgraph Reformulation and Multi-agent Deep Reinforcement Learning [J].

Gong, Kai ;

Sun, Qiwei ;

Zhong, Xiaofang ;

Zhang, Yanhua .

ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 :751-762

[9]

Jiang QZ, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P3854

[10] Road Traffic Prediction Model Using Extreme Learning Machine: The Case Study of Tangier, Morocco [J].

Jiber, Mouna ;

Mbarek, Abdelilah ;

Yahyaouy, Ali ;

Sabri, My Abdelouahed ;

Boumhidi, Jaouad .

INFORMATION, 2020, 11 (12) :1-15

← 1 2 3 4 →