Multi-Agent Task Assignment in Vehicular Edge Computing: A Regret-Matching Learning-Based Approach

被引:4
作者
Nguyen, Bach Long [1 ]
Nguyen, Duong D. [2 ]
Nguyen, Hung X. [2 ]
Ngo, Duy T. [3 ]
Wagner, Markus [1 ]
机构
[1] Monash Univ, Dept Data Sci & AI, Clayton, Vic 3800, Australia
[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia
[3] Univ Newcastle, Sch Engn, Callaghan, NSW 2308, Australia
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 02期
基金
澳大利亚研究理事会;
关键词
Correlated equilibrium; intelligent transportation systems; multi-agent learning; regret matching; task assignment; vehicular edge computing; REINFORCEMENT; EQUILIBRIUM; COMPLEXITY; MIGRATION; VEHICLES;
D O I
10.1109/TETCI.2023.3339540
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vehicular edge computing has emerged as a solution for enabling computation-intensive applications within Intelligent Transportation Systems (ITS), encompassing domains like autonomous driving and augmented reality. Despite notable progress in this domain, the efficient allocation of constrained computational resources to a spectrum of time-critical ITS tasks remains a substantial challenge. We address this challenge by devising an innovative task assignment scheme tailored for vehicles navigating a highway. Given the high speed of vehicles and the limited communication radius of roadside units (RSUs), the dynamic migration of computation tasks among multiple servers becomes imperative. We present a novel approach that formulates the task assignment challenge as a binary nonlinear programming (BNLP) problem, managing the allocation of computation tasks from vehicles to RSUs and a macrocell base station. To tackle the potentially large dimensionality of this optimization problem, we develop a distributed multi-agent regret-matching learning algorithm. Incorporating the method of regret minimization, our proposed algorithm employs a forgetting mechanism that enables a continuous learning process, thereby accommodating the high mobility of vehicle networks. We prove that this algorithm converges towards correlated equilibrium solutions for our BNLP formulation. Extensive simulations, grounded in practical parameter settings, underscore the algorithm's ability to minimize total delay and task processing costs, while ensuring equitable utility distribution among agents.
引用
收藏
页码:1527 / 1539
页数:13
相关论文
共 41 条
[1]   Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-player General-Sum Games [J].
Anagnostides, Ioannis ;
Daskalakis, Constantinos ;
Farina, Gabriele ;
Fishelson, Maxwell ;
Golowich, Noah ;
Sandholm, Tuomas .
PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, :736-749
[2]   CORRELATED EQUILIBRIUM AS AN EXPRESSION OF BAYESIAN RATIONALITY [J].
AUMANN, RJ .
ECONOMETRICA, 1987, 55 (01) :1-18
[3]   Scheduling and Power Control for Connectivity Enhancement in Multi-Hop I2V/V2V Networks [J].
Bach Long Nguyen ;
Duy Trong Ngo ;
Dao, Minh N. ;
Vo Nguyen Quoc Bao ;
Vu, Hai L. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) :10322-10332
[4]   Efficient Task Assignment for Multiple Vehicles With Partially Unreachable Target Locations [J].
Bai, Xiaoshan ;
Yan, Weisheng ;
Ge, Shuzhi Sam .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) :3730-3742
[5]   Stochastic approximations and differential inclusions, part II:: Applications [J].
Benaim, Michel ;
Hofbauer, Josef ;
Sorin, Sylvain .
MATHEMATICS OF OPERATIONS RESEARCH, 2006, 31 (04) :673-695
[6]   Energy-Optimized Partial Computation Offloading in Mobile-Edge Computing With Genetic Simulated-Annealing-Based Particle Swarm Optimization [J].
Bi, Jing ;
Yuan, Haitao ;
Duanmu, Shuaifei ;
Zhou, MengChu ;
Abusorrah, Abdullah .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) :3774-3785
[7]  
Bistritz I., 2020, P ADV NEUR INF PROC, V33, P2016
[8]   Multi-UAV Mobile Edge Computing and Path Planning Platform Based on Reinforcement Learning [J].
Chang, Huan ;
Chen, Yicheng ;
Zhang, Baochang ;
Doermann, David .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (03) :489-498
[9]   Fast Rates for Nonparametric Online Learning: From Realizability to Learning in Games [J].
Daskalakis, Constantinos ;
Golowich, Noah .
PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, :846-859
[10]   Regret Matching Learning Based Spectrum Reuse in Small Cell Networks [J].
Fan, Chaoqiong ;
Li, Bin ;
Zhao, Chenglin ;
Liang, Ying-Chang .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (01) :1060-1064