Ensuring Threshold AoI for UAV-Assisted Mobile Crowdsensing by Multi-Agent Deep Reinforcement Learning With Transformer

被引:14
作者
Wang, Hao [1 ]
Liu, Chi Harold [1 ]
Yang, Haoming [1 ]
Wang, Guoren [1 ]
Leung, Kin K. [2 ,3 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
[2] Imperial Coll, Elect & Elect Engn EEE Dept, London SW7 2BT, England
[3] Imperial Coll, Comp Dept, London SW7 2BT, England
基金
中国国家自然科学基金;
关键词
~UAV crowdsensing; AoI; multi-agent deep reinforcement learning; transformer; DESIGN; OPTIMIZATION; COVERAGE;
D O I
10.1109/TNET.2023.3289172
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned aerial vehicle (UAV) crowdsensing (UCS) is an emerging data collection paradigm to provide reliable and high quality urban sensing services, with age-of information (AoI) requirement to measure data freshness in real-time applications. In this paper, we explicitly consider the case to ensure that the attained AoI always stay within a specific threshold. The goal is to maximize the total amount of collected data from diverse Point-of-Interests (PoIs) while minimizing AoI and AoI threshold violation ratio under limited energy supplement. To this end, we propose a decentralized multi-agent deep reinforcement learning framework called "DRL-UCS(AoI(th))" for multi-UAV trajectory planning, which consists of a novel transformer-enhanced distributed architecture and an adaptive intrinsic reward mechanism for spatial cooperation and exploration. Extensive results and trajectory visualization on two real-world datasets in Beijing and San Francisco show that, DRLUCS(AoI(th)) consistently outperforms all nine baselines when varying the number of UAVs, AoI threshold and generated data amount in a timeslot.
引用
收藏
页码:566 / 581
页数:16
相关论文
共 65 条
[1]  
Ba L. J., 2016, arXiv
[2]  
Bellalta B, 2016, IEEE WIREL COMMUN, V23, P38, DOI 10.1109/MWC.2016.7422404
[3]  
Berner Christopher, 2019, Dota 2 with large scale deep reinforcement learning
[4]  
Burda Y., 2019, 7th International Conference on Learning Representations, ICLR 2019, P1
[5]   The rise of people-centric sensing [J].
Campbell, Andrew T. ;
Lane, Nicholas D. ;
Miluzzo, Emiliano ;
Peterson, Ronald A. ;
Lu, Hong ;
Zheng, Xiao ;
Musolesi, Mirco ;
Fodor, Kristof ;
Ahn, Gahng-Seop ;
Eisenman, Shane B. .
IEEE INTERNET COMPUTING, 2008, 12 (04) :12-21
[6]  
Chen XY, 2019, ADV NEUR IN, V32
[7]  
Chung J., 2014, NEURAL INFORM PROCES
[8]  
Costa P. R. D. O., 2020, ARXIV
[9]  
Dai Z., 2021, Adv. Neural. Inf. Process. Syst, V34, P1
[10]  
Dai ZH, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2978