Stealthy Black-Box Attack With Dynamic Threshold Against MARL-Based Traffic Signal Control System

被引：4

作者：

Ren, Yan ^{[1
]}

Zhang, Heng ^{[1
,2
]}

Du, Linkang ^{[3
]}

Zhang, Zhikun ^{[4
]}

Zhang, Jian ^{[2
]}

Li, Hongran ^{[2
]}

机构：

[1] Jiangsu Ocean Univ, Coll Elect Engn, Lianyungang 222000, Peoples R China

[2] Jiangsu Ocean Univ, Coll Comp Engn, Lianyungang 222000, Peoples R China

[3] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310000, Peoples R China

[4] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310000, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2024年 / 20卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Training; Perturbation methods; Heuristic algorithms; Closed box; Optimization; Control systems; Vehicle dynamics; Adversarial attack; deep reinforcement learning (DRL); defense; security; traffic signal control; ROBUSTNESS;

D O I：

10.1109/TII.2024.3413356

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multiagent reinforcement learning (MARL) promises outstanding performance for multiintersection traffic signal control systems (TSCS), enabling intelligent administration of cities. However, the vulnerability of MARL algorithms to adversarial attacks has raised concerns about the security of TSCS. In this article, we explore the robustness of MARL-based TSCS against adversarial attacks, propose a black-box multiobject attack strategy, and assign an attack budget to ensure stealthiness. We design a dynamic threshold-based selection of critical states to minimize the cumulative reward with a limited number of attacks. In addition, we present a lightweight agnostic dynamic threshold-based defense mechanism by enhancing the worst-case performance of the policy. We formulate it as a min-max optimization problem, i.e., minimizing the quantity of training sample alterations while maximizing the cumulative discount reward of policy against the perturbed states. Extensive experiments on simulation of urban mobility (SUMO) demonstrate that the proposed attack policy can significantly reduce the performance of TSCS.

引用

页码：12021 / 12031

页数：11

共 40 条

[1]

Alegre L. N., 2019, US

[2] Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events [J].

Aslani, Mohammad ;

Mesgari, Mohammad Saadi ;

Wiering, Marco .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 85 :732-752

[3]

Behzadan V., 2017, ARXIV

[4] Towards Evaluating the Robustness of Neural Networks [J].

Carlini, Nicholas ;

Wagner, David .

2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, :39-57

[5] Exposing Congestion Attack on Emerging Connected Vehicle based Traffic Signal Control [J].

Chen, Qi Alfred ;

Yin, Yucheng ;

Feng, Yiheng ;

Mao, Z. Morley ;

Liu, Henry X. .

25TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2018), 2018,

[6]

Croce F, 2020, PR MACH LEARN RES, V119

[7] Vulnerability of Traffic Control System Under Cyberattacks with Falsified Data [J].

Feng, Yiheng ;

Huang, Shihong ;

Chen, Qi Alfred ;

Liu, Henry X. ;

Mao, Z. Morley .

TRANSPORTATION RESEARCH RECORD, 2018, 2672 (01) :1-11

[8]

Figura M, 2021, P AMER CONTR CONF, P3050, DOI 10.23919/ACC50511.2021.9483080

[9] Towards Comprehensive Testing on the Robustness of Cooperative Multi-agent Reinforcement Learning [J].

Guo, Jun ;

Chen, Yonghong ;

Hao, Yihang ;

Yin, Zixin ;

Yu, Yin ;

Li, Simin .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :114-121

[10] Adversarial Attacks and Defense in Deep Reinforcement Learning (DRL)-Based Traffic Signal Controllers [J].

Haydari, Ammar ;

Zhang, Michael ;

Chuah, Chen-Nee .

IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 2 :402-416

← 1 2 3 4 →