Urban Traffic Control in Software Defined Internet of Things via a Multi-Agent Deep Reinforcement Learning Approach

被引：159

作者：

Yang, Jiachen ^{[1
]}

Zhang, Jipeng ^{[1
]}

Wang, Huihui ^{[2
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Jacksonville Univ, Dept Engn, Jacksonville, FL 32211 USA

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2021年 / 22卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Machine learning; Feature extraction; Switches; Software; Internet of Things; Protocols; Urban traffic control; software defined internet of things; multi-agent deep reinforcement learning; modified proximal policy optimization; SIGNAL CONTROL; SYSTEM; LEVEL;

D O I：

10.1109/TITS.2020.3023788

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

As the growth of vehicles and the acceleration of urbanization, the urban traffic congestion problem becomes a burning issue in our society. Constructing a software defined Internet of things(SD-IoT) with a proper traffic control scheme is a promising solution for this issue. However, existing traffic control schemes do not make the best of the advances of the multi-agent deep reinforcement learning area. Furthermore, existing traffic congestion solutions based on deep reinforcement learning(DRL) only focus on controlling the signal of traffic lights, while ignore controlling vehicles to cooperate traffic lights. So the effect of urban traffic control is not comprehensive enough. In this article, we propose Modified Proximal Policy Optimization (Modified PPO) algorithm. This algorithm is ideally suited as the traffic control scheme of SD-IoT. We adaptively adjust the clip hyperparameter to limit the bound of the distance between the next policy and the current policy. What's more, based on the collected data of SD-IoT, the proposed algorithm controls traffic lights and vehicles in a global view to advance the performance of urban traffic control. Experimental results under different vehicle numbers show that the proposed method is more competitive and stable than the original algorithm. Our proposed method improves the performance of SD-IoT to relieve traffic congestion.

引用

页码：3742 / 3754

页数：13

共 59 条

[21] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

[22]

Ku Ian, 2014, 2014 13th Annual Mediterranean Ad Hoc Networking Workshop (MED-HOC-NET), P103, DOI 10.1109/MedHocNet.2014.6849111

[23]

Lantz B., P 9 ACM SIGCOMM WORK, P1

[24] CornerNet: Detecting Objects as Paired Keypoints [J].

Law, Hei ;

Deng, Jia .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (03) :642-656

[25] AliMe Assist: An Intelligent Assistant for Creating an Innovative E-commerce Experience [J].

Li, Feng-Lin ;

Qiu, Minghui ;

Chen, Haiqing ;

Wang, Xiongwei ;

Gao, Xing ;

Huang, Jun ;

Ren, Juwei ;

Zhao, Zhongzhou ;

Zhao, Weipeng ;

Wang, Lei ;

Jin, Guwei ;

Chu, Wei .

CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, :2495-2498

[26] Traffic signal timing via deep reinforcement learning [J].

Li L. ;

Lv Y. ;

Wang F.-Y. .

IEEE/CAA Journal of Automatica Sinica, 2016, 3 (03) :247-254

[27]

Lillicrap T. P., 2015, PREPRINT

[28]

Lowe R, 2017, ADV NEUR IN, V30

[29] OpenFlow: Enabling innovation in campus networks [J].

McKeown, Nick ;

Anderson, Tom ;

Balakrishnan, Hari ;

Parulkar, Guru ;

Peterson, Larry ;

Rexford, Jennifer ;

Shenker, Scott ;

Turner, Jonathan .

ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2008, 38 (02) :69-74

[30]

Misbahuddin S, 2015, 2015 12TH INTERNATIONAL CONFERENCE ON HIGH-CAPACITY OPTICAL NETWORKS AND ENABLING/EMERGING TECHNOLOGIES (HONET), P142

← 1 2 3 4 5 6 →