Efficient Resource Allocation for Multi-Beam Satellite-Terrestrial Vehicular Networks: A Multi-Agent Actor-Critic Method With Attention Mechanism

被引:41
作者
He, Ying [1 ]
Wang, Yuhang [1 ]
Yu, F. Richard [2 ]
Lin, Qiuzhen [1 ]
Li, Jianqiang [1 ]
Leung, Victor C. M. [3 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518000, Peoples R China
[2] Carleton Univ, Sch Informat Technol, Ottawa, ON K1S 5B6, Canada
[3] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
基金
中国国家自然科学基金;
关键词
Satellites; Bandwidth; Resource management; Reinforcement learning; Training; Task analysis; Quality of service; Multi-beam satellites; satellite-terrestrial networks; multi-agent reinforcement learning; resources allocation; attention mechanism; INTEGRATED NETWORK; REINFORCEMENT;
D O I
10.1109/TITS.2021.3128209
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
With the rapid development of intelligent transportation systems, there is an increasing demand for a variety of vehicular services, such as automated driving assistance, emergency alert, infotainment, etc. However, in some situations (e.g., remote areas or maritime scenarios), the terrestrial networks alone cannot serve the vehicular applications very well due to the infrastructure deployment and maintenance issues. Satellite networks have become an effective supplement to terrestrial networks, which complement well in terms of coverage, flexibility, reliability, and availability. In this paper, we consider the low orbit multi-beam satellite-terrestrial networks to serve for vehicles. We model this problem as a cooperative multi-agent reinforcement learning process, where each beam acts as an agent, and the global bandwidth is cooperatively shared among all the agents. A multi-agent actor-critic method with attention mechanism is proposed to allocate resources for vehicles with strict delay requirements and minimum bandwidth consumption. When allocating bandwidth, the channel efficiency, the angle of the beams and the priorities of requests in different regions are also considered. Centralized training and distributed execution is performed in the training of the agents. Extensive simulation results verify the effectiveness of our proposed method, where all the agents can well cooperative to achieve efficient resource allocation on-demand for the vehicles under strictly limited bandwidth resources.
引用
收藏
页码:2727 / 2738
页数:12
相关论文
共 45 条
[1]  
Busoniu L, 2010, STUD COMPUT INTELL, V310, P183
[2]   Satellite Communications Supporting Internet of Remote Things [J].
De Sanctis, Mauro ;
Cianca, Ernestina ;
Araniti, Giuseppe ;
Bisio, Igor ;
Prasad, Ramjee .
IEEE INTERNET OF THINGS JOURNAL, 2016, 3 (01) :113-123
[3]  
Deutschmann Jorg, 2020, Measurement, Modelling and Evaluation of Computing Systems. 20th International GI/ITG Conference, MMB 2020. Proceedings. Lecture Notes in Computer Science (LNCS 12040), P65, DOI 10.1007/978-3-030-43024-5_5
[4]  
Foerster J, 2017, PR MACH LEARN RES, V70
[5]  
Foerster JN, 2016, ADV NEUR IN, V29
[6]  
Foerster JN, 2018, AAAI CONF ARTIF INTE, P2974
[7]   An overview of the IRIDIUM® low earth orbit (LEO) satellite system [J].
Fossa, CE ;
Raines, RA ;
Gunsch, GH ;
Temple, MA .
PROCEEDINGS OF THE IEEE 1998 NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE, 1998, :152-159
[8]  
Haarnoja T., 2018, Soft actor-critic algorithms and applications
[9]   An Efficient Ciphertext-Policy Attribute-Based Encryption Scheme Supporting Collaborative Decryption With Blockchain [J].
He, Ying ;
Wang, Haiyan ;
Li, Yuan ;
Huang, Ke ;
Leung, Victor C. M. ;
Yu, F. Richard ;
Ming, Zhong .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (04) :2722-2733
[10]   Blockchain-Based Edge Computing Resource Allocation in IoT: A Deep Reinforcement Learning Approach [J].
He, Ying ;
Wang, Yuhang ;
Qiu, Chao ;
Lin, Qiuzhen ;
Li, Jianqiang ;
Ming, Zhong .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) :2226-2237