Exploring reward efficacy in traffic management using deep reinforcement learning in intelligent transportation system

被引:10
作者
Paul, Ananya [1 ]
Mitra, Sulata [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Shalimar, India
关键词
DRL; edge computing; ITS; PPO; traffic signal;
D O I
10.4218/etrij.2021-0404
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the last decade, substantial progress has been achieved in intelligent traffic control technologies to overcome consistent difficulties of traffic congestion and its adverse effect on smart cities. Edge computing is one such advanced progress facilitating real-time data transmission among vehicles and roadside units to mitigate congestion. An edge computing-based deep reinforcement learning system is demonstrated in this study that appropriately designs a multiobjective reward function for optimizing different objectives. The system seeks to overcome the challenge of evaluating actions with a simple numerical reward. The selection of reward functions has a significant impact on agents' ability to acquire the ideal behavior for managing multiple traffic signals in a large-scale road network. To ascertain effective reward functions, the agent is trained withusing the proximal policy optimization method in several deep neural network models, including the state-of-the-art transformer network. The system is verified using both hypothetical scenarios and real-world traffic maps. The comprehensive simulation outcomes demonstrate the potency of the suggested reward functions.
引用
收藏
页码:194 / 207
页数:14
相关论文
共 9 条
  • [1] [Anonymous], 2018, ARXIV PREPRINT ARXIV
  • [2] Egea AC, 2020, IEEE SYS MAN CYBERN, P965, DOI [10.1109/smc42975.2020.9283498, 10.1109/SMC42975.2020.9283498]
  • [3] Cooperative Control for Multi-Intersection Traffic Signal Based on Deep Reinforcement Learning and Imitation Learning
    Huo, Yusen
    Tao, Qinghua
    Hu, Jianming
    [J]. IEEE ACCESS, 2020, 8 : 199573 - 199585
  • [4] Adaptive traffic signal control system using composite reward architecture based deep reinforcement learning
    Jamil, Abu Rafe Md
    Ganguly, Kishan Kumar
    Nower, Naushin
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (14) : 2030 - 2041
  • [5] Paul Ananya, 2021, ISMSI 2021: 2021 5th International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence, P60, DOI 10.1145/3461598.3461608
  • [6] Deep Reinforcement Learning based Traffic Signal Optimization for Multiple Intersections in ITS
    Paul, Ananya
    Mitra, Sulata
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (IEEE ANTS), 2020,
  • [7] Adaptive Traffic Signal Control : Exploring Reward Definition For Reinforcement Learning
    Touhbi, Saad
    Babram, Mohamed Ait
    Tri Nguyen-Huu
    Marilleau, Nicolas
    Hbid, Moulay L.
    Cambier, Christophe
    Stinckwich, Serge
    [J]. 8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 : 513 - 520
  • [8] IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control
    Wei, Hua
    Zheng, Guanjie
    Yao, Huaxiu
    Li, Zhenhui
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2496 - 2505
  • [9] Traffic Signal Control Using Deep Reinforcement Learning with Multiple Resources of Rewards
    Zhong, Dunhao
    Boukerche, Azzedine
    [J]. PE-WASUN'19: PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYMPOSIUM ON PERFORMANCE EVALUATION OF WIRELESS AD HOC, SENSOR, & UBIQUITOUS NETWORKS, 2019, : 23 - 28