Collaborative Task Offloading Optimization for Satellite Mobile Edge Computing Using Multi-Agent Deep Reinforcement Learning

被引:12
作者
Zhang, Hangyu [1 ]
Zhao, Hongbo [2 ]
Liu, Rongke [1 ,2 ]
Kaushik, Aryan [3 ]
Gao, Xiangqiang [1 ,4 ]
Xu, Shenzhan
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Shenzhen Inst, Shenzhen 518063, Peoples R China
[3] Univ Sussex, Sch Engn & Informat, Brighton BN1 9RH, E Sussex, England
[4] China Acad Space Technol Xian, Xian 710100, Peoples R China
基金
北京市自然科学基金;
关键词
Satellites; Task analysis; Satellite broadcasting; Optimization; Low earth orbit satellites; Computational modeling; Resource management; Satellite mobile edge computing; distributed cooperative computing; computation offloading; resource allocation; multi-agent deep reinforcement learning; CONNECTIVITY; INTEGRATION; CHALLENGES; NETWORKS; TRENDS; LINKS;
D O I
10.1109/TVT.2024.3405642
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Satellite mobile edge computing (SMEC) achieves efficient processing for space missions by deploying computing servers on low Earth orbit (LEO) satellites, which supplements a strong computing service for future satellite-terrestrial integrated networks. However, considering the spatio-temporal constraints on large-scale LEO networks, inter-satellite cooperative computing is still challenging. In this paper, a multi-agent collaborative task offloading scheme for distributed SMEC is proposed. Facing the time-varying available satellites and service requirements, each autonomous satellite agent dynamically adjusts offloading decisions and resource allocations based on local observations. Furthermore, for evaluating the behavioral contribution of an agent to task completion, we adopt a deep reinforcement learning algorithm based on counterfactual multi-agent policy gradients (COMA) to optimize the strategy, which enables energy-efficient decisions satisfying the time and resource restrictions of SMEC. An actor-critic (AC) framework is effectively exploited to separately implement centralized training and distributed execution (CTDE) of the algorithm. We also redesign the actor structure by introducing an attention-based bidirectional long short-term memory network (Atten-BiLSTM) to explore the temporal characteristics of LEO networks. The simulation results show that the proposed scheme can effectively enable satellite autonomous collaborative computing in the distributed SMEC environment, and outperforms the benchmark algorithms.
引用
收藏
页码:15483 / 15498
页数:16
相关论文
共 43 条
[1]   Improving time series forecasting using LSTM and attention models [J].
Abbasimehr, Hossein ;
Paki, Reza .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (01) :673-691
[2]   Applying autonomy to distributed satellite systems: Trends, challenges, and future prospects [J].
Araguz, Carles ;
Bou-Balust, Elisenda ;
Alarcon, Eduard .
SYSTEMS ENGINEERING, 2018, 21 (05) :401-416
[3]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[4]   A Review on Early Forest Fire Detection Systems Using Optical Remote Sensing [J].
Barmpoutis, Panagiotis ;
Papaioannou, Periklis ;
Dimitropoulos, Kosmas ;
Grammalidis, Nikos .
SENSORS, 2020, 20 (22) :1-26
[5]   Satellite Networking Integration in the 5G Ecosystem: Research Trends and Open Challenges [J].
Boero, Luca ;
Bruschi, Roberto ;
Davoli, Franco ;
Marchese, Mario ;
Patrone, Fabio .
IEEE NETWORK, 2018, 32 (05) :9-15
[6]   Integration of Satellite and LTE for Disaster Recovery [J].
Casoni, Maurizio ;
Grazia, Carlo Augusto ;
Klapez, Martin ;
Patriciello, Natale ;
Amditis, A. ;
Sdongos, E. .
IEEE COMMUNICATIONS MAGAZINE, 2015, 53 (03) :47-53
[7]  
Foerster JN, 2018, AAAI CONF ARTIF INTE, P2974
[8]   Collaborative Multi-Resource Allocation in Terrestrial-Satellite Network Towards 6G [J].
Fu, Shu ;
Gao, Jie ;
Zhao, Lian .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (11) :7057-7071
[9]  
Gao X., 2022, P IEEE 1 INT C 6G NE, P1
[10]   Dynamic Resource Allocation for Virtual Network Function Placement in Satellite Edge Clouds [J].
Gao, Xiangqiang ;
Liu, Rongke ;
Kaushik, Aryan ;
Zhang, Hangyu .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (04) :2252-2265