Coordinated Control of Distributed Traffic Signal Based on Multiagent Cooperative Game

被引：5

作者：

Zhang, Zhenghua ^{[1
]}

Qian, Jin ^{[1
]}

Fang, Chongxin ^{[1
]}

Liu, Guoshu ^{[2
]}

Su, Quan ^{[2
]}

机构：

[1] Yangzhou Univ, Coll Informat Engn, Yangzhou, Jiangsu, Peoples R China

[2] Yangzhou Guomai Commun Dev Co LTD, Yangzhou, Jiangsu, Peoples R China

来源：

WIRELESS COMMUNICATIONS & MOBILE COMPUTING | 2021年 / 2021卷

关键词：

NETWORK;

D O I：

10.1155/2021/6693636

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the adaptive traffic signal control (ATSC), reinforcement learning (RL) is a frontier research hotspot, combined with deep neural networks to further enhance its learning ability. The distributed multiagent RL (MARL) can avoid this kind of problem by observing some areas of each local RL in the complex plane traffic area. However, due to the limited communication capabilities between each agent, the environment becomes partially visible. This paper proposes multiagent reinforcement learning based on cooperative game (CG-MARL) to design the intersection as an agent structure. The method considers not only the communication and coordination between agents but also the game between agents. Each agent observes its own area to learn the RL strategy and value function, then concentrates the Q function from different agents through a hybrid network, and finally forms its own final Q function in the entire large-scale transportation network. The results show that the proposed method is superior to the traditional control method.

引用

页数：13

共 26 条

[1] Path dependent coordination of expectations in asset pricing experiments: A behavioral explanation [J].

Agliari, Anna ;

Hommes, Cars H. ;

Pecora, Nicolo .

JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2016, 121 :15-28

[2] Reinforcement learning-based multi-agent system for network traffic signal control [J].

Arel, I. ;

Liu, C. ;

Urbanik, T. ;

Kohls, A. G. .

IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) :128-135

[3] Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events [J].

Aslani, Mohammad ;

Mesgari, Mohammad Saadi ;

Wiering, Marco .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 85 :732-752

[4] Optimal Type-2 Fuzzy System For Arterial Traffic Signal Control [J].

Bi, Yunrui ;

Lu, Xiaobo ;

Sun, Zhe ;

Srinivasan, Dipti ;

Sun, Zhixin .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (09) :3009-3027

[5] Is there more traffic congestion in larger cities? -Scaling analysis of the 101 largest US urban centers- [J].

Chang, Yu Sang ;

Lee, Yong Joo ;

Choi, Sung Sup Brian .

TRANSPORT POLICY, 2017, 59 :54-63

[6]

Chu TS, 2014, IEEE DECIS CONTR P, P1277, DOI 10.1109/CDC.2014.7039557

[7]

De Schutter B., 1999, Proceedings of the 1999 American Control Conference (Cat. No. 99CH36251), P2195, DOI 10.1109/ACC.1999.786344

[8] Optimal Cycle Program of Traffic Lights With Particle Swarm Optimization [J].

Garcia-Nieto, Jose ;

Carolina Olivera, Ana ;

Alba, Enrique .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2013, 17 (06) :823-839

[9] Cooperative Deep Q-Learning With Q-Value Transfer for Multi-Intersection Signal Control [J].

Ge, Hongwei ;

Song, Yumei ;

Wu, Chunguo ;

Ren, Jiankang ;

Tan, Guozhen .

IEEE ACCESS, 2019, 7 :40797-40809

[10] MEASUREMENT OF LIGAND RECEPTOR INTERACTIONS [J].

HELM, CA ;

KNOLL, W ;

ISRAELACHVILI, JN .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (18) :8169-8173

← 1 2 3 →