Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach

被引：1

作者：

He, Xingqiu ^{[1
,2
]}

You, Chaoqun ^{[1
,2
]}

Quek, Tony Q. S. ^{[3
,4
]}

机构：

[1] Fudan Univ, Intelligent Networking & Comp Res Ctr, Shanghai 200437, Peoples R China

[2] Fudan Univ, Sch Comp Sci, Shanghai 200437, Peoples R China

[3] Singapore Univ Technol & Design, Singapore 487372, Singapore

[4] Yonsei Univ, Yonsei Frontier Lab, Seoul 03722, South Korea

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 10期

基金：

新加坡国家研究基金会;

关键词：

Task analysis; Heuristic algorithms; System dynamics; Measurement; Data processing; Minimization; Servers; Age of information; mobile edge computing; post-decision state; deep reinforcement learning; RESOURCE-ALLOCATION; STATUS UPDATE; PEAK AGE; INFORMATION; COMPUTATION; OPTIMIZATION; NETWORKS; MANAGEMENT; TRADEOFF; QUEUE;

D O I：

10.1109/TMC.2024.3370101

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid development of Mobile Edge Computing (MEC), various real-time applications have been deployed to benefit people's daily lives. The performance of these applications relies heavily on the freshness of collected environmental information, which can be quantified by its Age of Information (AoI). In the traditional definition of AoI, it is assumed that the status information can be actively sampled and directly used. However, for many MEC-enabled applications, the desired status information is updated in an event-driven manner and necessitates data processing. To better serve these applications, we propose a new definition of AoI and, based on the redefined AoI, we formulate an online AoI minimization problem for MEC systems. Notably, the problem can be interpreted as a Markov Decision Process (MDP), thus enabling its solution through Reinforcement Learning (RL) algorithms. Nevertheless, the traditional RL algorithms are designed for MDPs with completely unknown system dynamics and hence usually suffer long convergence times. To accelerate the learning process, we introduce Post-Decision States (PDSs) to exploit the partial knowledge of the system's dynamics. We also combine PDSs with deep RL to further improve the algorithm's applicability, scalability, and robustness. Numerical results demonstrate that our algorithm outperforms the benchmarks under various scenarios.

引用

页码：9881 / 9897

页数：17

共 70 条

[1] Finding the Exact Distribution of (Peak) Age of Information for Queues of PH/PH/1/1 and M/PH/1/2 Type
Akar, Nail
Dogan, Ozancan
Atay, Eray Unsal
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (09) : 5661 - 5672
[2] Altman Eitan, 1999, Constrained Markov Decision Processes, V7
[3] Becvar P.MachandZ., 2017, IEEE Commun. Surveys Tut., V19, P1656
[4] Bertsekas D. P., 2000, Dynamic Programming and Optimal Control, V2nd
[5] Champati JP, 2018, IEEE CONF COMPUT, P130
[6] LOW-POWER CMOS DIGITAL DESIGN
CHANDRAKASAN, AP
SHENG, S
BRODERSEN, RW
[J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1992, 27 (04) : 473 - 484
[7] Joint Optimization of Sensing and Computation for Status Update in Mobile Edge Computing Systems
Chen, Yi
Chang, Zheng
Min, Geyong
Mao, Shiwen
Hamalainen, Timo
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (11) : 8230 - 8243
[8] On the Age of Information in Status Update Systems With Packet Management
Costa, Maice
Codreanu, Marian
Ephremides, Anthony
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (04) : 1897 - 1910
[9] Costa M, 2014, IEEE INT SYMP INFO, P1583, DOI 10.1109/ISIT.2014.6875100
[10] Smart anomaly detection in sensor systems: A multi-perspective review
Erhan, L.
Ndubuaku, M.
Di Mauro, M.
Song, W.
Chen, M.
Fortino, G.
Bagdasar, O.
Liotta, A.
[J]. INFORMATION FUSION, 2021, 67 : 64 - 79

← 1 2 3 4 5 6 7 →