Exploring reward efficacy in traffic management using deep reinforcement learning in intelligent transportation system

被引：10

作者：

Paul, Ananya ^{[1
]}

Mitra, Sulata ^{[1
]}

机构：

[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Shalimar, India

来源：

ETRI JOURNAL | 2022年 / 44卷 / 02期

关键词：

DRL; edge computing; ITS; PPO; traffic signal;

D O I：

10.4218/etrij.2021-0404

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In the last decade, substantial progress has been achieved in intelligent traffic control technologies to overcome consistent difficulties of traffic congestion and its adverse effect on smart cities. Edge computing is one such advanced progress facilitating real-time data transmission among vehicles and roadside units to mitigate congestion. An edge computing-based deep reinforcement learning system is demonstrated in this study that appropriately designs a multiobjective reward function for optimizing different objectives. The system seeks to overcome the challenge of evaluating actions with a simple numerical reward. The selection of reward functions has a significant impact on agents' ability to acquire the ideal behavior for managing multiple traffic signals in a large-scale road network. To ascertain effective reward functions, the agent is trained withusing the proximal policy optimization method in several deep neural network models, including the state-of-the-art transformer network. The system is verified using both hypothetical scenarios and real-world traffic maps. The comprehensive simulation outcomes demonstrate the potency of the suggested reward functions.

引用

页码：194 / 207

页数：14

共 9 条

[1] [Anonymous], 2018, ARXIV PREPRINT ARXIV
[2] Egea AC, 2020, IEEE SYS MAN CYBERN, P965, DOI [10.1109/smc42975.2020.9283498, 10.1109/SMC42975.2020.9283498]
[3] Cooperative Control for Multi-Intersection Traffic Signal Based on Deep Reinforcement Learning and Imitation Learning
Huo, Yusen
Tao, Qinghua
Hu, Jianming
[J]. IEEE ACCESS, 2020, 8 : 199573 - 199585
[4] Adaptive traffic signal control system using composite reward architecture based deep reinforcement learning
Jamil, Abu Rafe Md
Ganguly, Kishan Kumar
Nower, Naushin
[J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (14) : 2030 - 2041
[5] Paul Ananya, 2021, ISMSI 2021: 2021 5th International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence, P60, DOI 10.1145/3461598.3461608
[6] Deep Reinforcement Learning based Traffic Signal Optimization for Multiple Intersections in ITS
Paul, Ananya
Mitra, Sulata
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (IEEE ANTS), 2020,
[7] Adaptive Traffic Signal Control : Exploring Reward Definition For Reinforcement Learning
Touhbi, Saad
Babram, Mohamed Ait
Tri Nguyen-Huu
Marilleau, Nicolas
Hbid, Moulay L.
Cambier, Christophe
Stinckwich, Serge
[J]. 8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 : 513 - 520
[8] IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control
Wei, Hua
Zheng, Guanjie
Yao, Huaxiu
Li, Zhenhui
[J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2496 - 2505
[9] Traffic Signal Control Using Deep Reinforcement Learning with Multiple Resources of Rewards
Zhong, Dunhao
Boukerche, Azzedine
[J]. PE-WASUN'19: PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYMPOSIUM ON PERFORMANCE EVALUATION OF WIRELESS AD HOC, SENSOR, & UBIQUITOUS NETWORKS, 2019, : 23 - 28

← 1 →