Ensuring Threshold AoI for UAV-Assisted Mobile Crowdsensing by Multi-Agent Deep Reinforcement Learning With Transformer

被引：14

作者：

Wang, Hao ^{[1
]}

Liu, Chi Harold ^{[1
]}

Yang, Haoming ^{[1
]}

Wang, Guoren ^{[1
]}

Leung, Kin K. ^{[2
,3
]}

机构：

[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China

[2] Imperial Coll, Elect & Elect Engn EEE Dept, London SW7 2BT, England

[3] Imperial Coll, Comp Dept, London SW7 2BT, England

来源：

IEEE-ACM TRANSACTIONS ON NETWORKING | 2024年 / 32卷 / 01期

基金：

中国国家自然科学基金;

关键词：

~UAV crowdsensing; AoI; multi-agent deep reinforcement learning; transformer; DESIGN; OPTIMIZATION; COVERAGE;

D O I：

10.1109/TNET.2023.3289172

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Unmanned aerial vehicle (UAV) crowdsensing (UCS) is an emerging data collection paradigm to provide reliable and high quality urban sensing services, with age-of information (AoI) requirement to measure data freshness in real-time applications. In this paper, we explicitly consider the case to ensure that the attained AoI always stay within a specific threshold. The goal is to maximize the total amount of collected data from diverse Point-of-Interests (PoIs) while minimizing AoI and AoI threshold violation ratio under limited energy supplement. To this end, we propose a decentralized multi-agent deep reinforcement learning framework called "DRL-UCS(AoI(th))" for multi-UAV trajectory planning, which consists of a novel transformer-enhanced distributed architecture and an adaptive intrinsic reward mechanism for spatial cooperation and exploration. Extensive results and trajectory visualization on two real-world datasets in Beijing and San Francisco show that, DRLUCS(AoI(th)) consistently outperforms all nine baselines when varying the number of UAVs, AoI threshold and generated data amount in a timeslot.

引用

页码：566 / 581

页数：16

共 65 条

[1]

Ba L. J., 2016, arXiv

[2]

Bellalta B, 2016, IEEE WIREL COMMUN, V23, P38, DOI 10.1109/MWC.2016.7422404

[3]

Berner Christopher, 2019, Dota 2 with large scale deep reinforcement learning

[4]

Burda Y., 2019, 7th International Conference on Learning Representations, ICLR 2019, P1

[5] The rise of people-centric sensing [J].

Campbell, Andrew T. ;

Lane, Nicholas D. ;

Miluzzo, Emiliano ;

Peterson, Ronald A. ;

Lu, Hong ;

Zheng, Xiao ;

Musolesi, Mirco ;

Fodor, Kristof ;

Ahn, Gahng-Seop ;

Eisenman, Shane B. .

IEEE INTERNET COMPUTING, 2008, 12 (04) :12-21

[6]

Chen XY, 2019, ADV NEUR IN, V32

[7]

Chung J., 2014, NEURAL INFORM PROCES

[8]

Costa P. R. D. O., 2020, ARXIV

[9]

Dai Z., 2021, Adv. Neural. Inf. Process. Syst, V34, P1

[10]

Dai ZH, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2978

← 1 2 3 4 5 6 7 →