Scheduling UAV Swarm with Attention-based Graph Reinforcement Learning for Ground-to-air Heterogeneous Data Communication

被引:7
|
作者
Ren, Jiyuan [1 ]
Xu, Yanggang [1 ]
Li, Zuxin [1 ]
Hong, Chaopeng [2 ]
Zhang, Xiao-Ping [3 ,4 ]
Chen, Xinlei [5 ]
机构
[1] Tsinghua Univ, Tsinghua Berkeley Shenzhen Inst, Shenzhen, Peoples R China
[2] Tsinghua Univ, Shenzhen Key Lab Ecol Remediat & Carbon Sequestra, Inst Environm & Ecol, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[3] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, RISC V Int Open Source Lab, Shenzhen, Peoples R China
[4] Ryerson Univ, Dept Elect Comp & Biomed Engn, Toronto, ON, Canada
[5] Tsinghua Univ, Shenzhen Int Grad Sch, Pengcheng Lab, RISC V Int Open Source Lab, Shenzhen, Peoples R China
来源
ADJUNCT PROCEEDINGS OF THE 2023 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING & THE 2023 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTING, UBICOMP/ISWC 2023 ADJUNCT | 2023年
基金
国家重点研发计划;
关键词
Disaster Response; Path Planning; Graph Neural Network; Model-based Reinforcement Learning;
D O I
10.1145/3594739.3612905
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In disaster scenarios, unmanned aerial vehicles (UAVs) can serve as mobile base stations because of their maneuverability and synergy. However, due to constrained UAV communication capabilities and limited battery life, UAV base stations resource allocation for mobile sensors in a data-heterogeneous environment is a significant challenge when optimizing communication quality. To address this, we propose AGUZero, an attention-based graph reinforcement learning (RL) framework. Inspired by MuZero [27], AGUZero is designed to handle dynamic and uncontrollable environments based on Monte Carlo Tree Search (MCTS). Additionally, to tackle data heterogeneity, AGUZero represents the states using heterogeneous sub-graphs and employs an attention-based model to capture relationships among UAVs and sensors. The experimental results show that AGUZero outperforms other baseline models consistently when either the number of UAVs or the number of sensors is varying. AGUZero improves the data transmission ratio by 11.03% and 10.35% in the two cases respectively.
引用
收藏
页码:670 / 675
页数:6
相关论文
共 50 条
  • [1] Reinforcement Learning based Scheduling for Heterogeneous UAV Networking
    Wang, Jian
    Liu, Yongxin
    Niu, Shuteng
    Song, Houbing
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 420 - 427
  • [2] Attention-Based Multiagent Graph Reinforcement Learning for Service Restoration
    Fan B.
    Liu X.
    Xiao G.
    Kang Y.
    Wang D.
    Wang P.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 2163 - 2178
  • [3] Heterogeneous mission planning for a single unmanned aerial vehicle (UAV) with attention-based deep reinforcement learning
    Jung, Minjae
    Oh, Hyondong
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [4] Heterogeneous mission planning for a single unmanned aerial vehicle (UAV) with attention-based deep reinforcement learning
    Jung M.
    Oh H.
    PeerJ Computer Science, 2022, 8
  • [5] An Attention-Based Graph Neural Network for Heterogeneous Structural Learning
    Hong, Huiting
    Guo, Hantao
    Lin, Yucheng
    Yang, Xiaoqing
    Li, Zang
    Ye, Jieping
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4132 - 4139
  • [6] Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning
    Liu, Jia-yi
    Wang, Gang
    Fu, Qiang
    Yue, Shao-hua
    Wang, Si-yuan
    DEFENCE TECHNOLOGY, 2023, 19 : 210 - 219
  • [7] Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning
    Jia-yi Liu
    Gang Wang
    Qiang Fu
    Shao-hua Yue
    Si-yuan Wang
    Defence Technology, 2023, 19 (01) : 210 - 219
  • [8] ADRL: An attention-based deep reinforcement learning framework for knowledge graph reasoning
    Wang, Qi
    Hao, Yongsheng
    Cao, Jie
    KNOWLEDGE-BASED SYSTEMS, 2020, 197
  • [9] A Heterogeneous Acceleration System for Attention-Based Multi-Agent Reinforcement Learning
    Wiggins, Samuel
    Meng, Yuan
    Iyer, Mahesh A.
    Prasanna, Viktor
    2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024, 2024, : 236 - 242
  • [10] Graph-Attention-Based Reinforcement Learning for Trajectory Design and Resource Assignment in Multi-UAV-Assisted Communication
    Feng, Zikai
    Wu, Di
    Huang, Mengxing
    Yuen, Chau
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (16): : 27421 - 27434