Multi-Agent Task Assignment in Vehicular Edge Computing: A Regret-Matching Learning-Based Approach

被引：4

作者：

Nguyen, Bach Long ^{[1
]}

Nguyen, Duong D. ^{[2
]}

Nguyen, Hung X. ^{[2
]}

Ngo, Duy T. ^{[3
]}

Wagner, Markus ^{[1
]}

机构：

[1] Monash Univ, Dept Data Sci & AI, Clayton, Vic 3800, Australia

[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia

[3] Univ Newcastle, Sch Engn, Callaghan, NSW 2308, Australia

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 02期

基金：

澳大利亚研究理事会;

关键词：

Correlated equilibrium; intelligent transportation systems; multi-agent learning; regret matching; task assignment; vehicular edge computing; REINFORCEMENT; EQUILIBRIUM; COMPLEXITY; MIGRATION; VEHICLES;

D O I：

10.1109/TETCI.2023.3339540

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Vehicular edge computing has emerged as a solution for enabling computation-intensive applications within Intelligent Transportation Systems (ITS), encompassing domains like autonomous driving and augmented reality. Despite notable progress in this domain, the efficient allocation of constrained computational resources to a spectrum of time-critical ITS tasks remains a substantial challenge. We address this challenge by devising an innovative task assignment scheme tailored for vehicles navigating a highway. Given the high speed of vehicles and the limited communication radius of roadside units (RSUs), the dynamic migration of computation tasks among multiple servers becomes imperative. We present a novel approach that formulates the task assignment challenge as a binary nonlinear programming (BNLP) problem, managing the allocation of computation tasks from vehicles to RSUs and a macrocell base station. To tackle the potentially large dimensionality of this optimization problem, we develop a distributed multi-agent regret-matching learning algorithm. Incorporating the method of regret minimization, our proposed algorithm employs a forgetting mechanism that enables a continuous learning process, thereby accommodating the high mobility of vehicle networks. We prove that this algorithm converges towards correlated equilibrium solutions for our BNLP formulation. Extensive simulations, grounded in practical parameter settings, underscore the algorithm's ability to minimize total delay and task processing costs, while ensuring equitable utility distribution among agents.

引用

页码：1527 / 1539

页数：13

共 41 条

[11]

Flaspohler Genevieve E., 2021, INT C MACHINE LEARNI, P3363

[12] IoT-Enabled Autonomous System Collaboration for Disaster-Area Management [J].

Girma, Abenezer ;

Bahadori, Niloofar ;

Sarkar, Mrinmoy ;

Tadewos, Tadewos G. ;

Behnia, Mohammad R. ;

Mahmoud, M. Nabil ;

Karimoddini, Ali ;

Homaifar, Abdollah .

IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (05) :1249-1262

[13] EXISTENCE OF CORRELATED EQUILIBRIA [J].

HART, S ;

SCHMEIDLER, D .

MATHEMATICS OF OPERATIONS RESEARCH, 1989, 14 (01) :18-25

[14] A simple adaptive procedure leading to correlated equilibrium [J].

Hart, S ;

Mas-Colell, A .

ECONOMETRICA, 2000, 68 (05) :1127-1150

[15]

Hart S., 2001, Economics Essays, P181, DOI [10.1007/978-3-662-04623-4_12, DOI 10.1007/978-3-662-04623-4_12]

[16] The query complexity of correlated equilibria [J].

Hart, Sergiu ;

Nisan, Noam .

GAMES AND ECONOMIC BEHAVIOR, 2018, 108 :401-410

[17] How long to equilibrium? The communication complexity of uncoupled equilibrium procedures [J].

Hart, Sergiu ;

Mansour, Yishay .

GAMES AND ECONOMIC BEHAVIOR, 2010, 69 (01) :107-126

[18]

Jain R., 1984, Tech. Rep. TR- 301, P1

[19] Joint Secure Offloading and Resource Allocation for Vehicular Edge Computing Network: A Multi-Agent Deep Reinforcement Learning Approach [J].

Ju, Ying ;

Chen, Yuchao ;

Cao, Zhiwei ;

Liu, Lei ;

Pei, Qingqi ;

Xiao, Ming ;

Ota, Kaoru ;

Dong, Mianxiong ;

Leung, Victor C. M. .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) :5555-5569

[20]

Krishnamurthy Vikram., 2016, Partially observed markov decision processes

← 1 2 3 4 5 →