A DEEP REINFORCEMENT LEARNING APPROACH TO FLOCKING AND NAVIGATION OF UAVS IN LARGE-SCALE COMPLEX ENVIRONMENTS

被引:0
|
作者
Wang, Chao [1 ]
Wang, Jian [1 ]
Zhang, Xudong [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
来源
2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018) | 2018年
关键词
UAV flocking; UAV navigation; flocking control; deep reinforcement learning; AGENTS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper aims at enabling unmanned aerial vehicles (UAV) to flock and meanwhile perform navigation tasks in large-scale complex environments in a fully decentralized manner. By incorporating the insights of flocking control inspired by bird flocking in nature, the problem is structured as a Markov decision process and solved by deep reinforcement learning. In particular, coordination among agents is achieved by following a local interaction protocol that each agent only considers the relative position of the nearest two neighbors on its left side and right side. In addition, a flocking control-inspired reward scheme is designed for the emergence of flocking and navigation behaviors. Simulation results demonstrate that by training with three UAVs, the learned policy, shared across all agents, can enable a larger number of UAVs to perform navigation tasks as a group in large-scale complex environments.
引用
收藏
页码:1228 / 1232
页数:5
相关论文
共 50 条
  • [41] Indoor Navigation with Deep Reinforcement Learning
    Bakale, Vijayalakshmi A.
    Kumar, Yeshwanth V. S.
    Roodagi, Vivekanand C.
    Kulkarni, Yashaswini N.
    Patil, Mahesh S.
    Chickerur, Satyadhyan
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 660 - 665
  • [42] End-to-end Decentralized Multi-robot Navigation in Unknown Complex Environments via Deep Reinforcement Learning
    Lin, Juntong
    Yang, Xuyun
    Zheng, Peiwei
    Cheng, Hui
    2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 2499 - 2506
  • [43] Engineering A Large-Scale Traffic Signal Control: A Multi-Agent Reinforcement Learning Approach
    Chen, Yue
    Li, Changle
    Yue, Wenwei
    Zhang, Hehe
    Mao, Guoqiang
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
  • [44] An Adaptive Metadata Management Scheme Based on Deep Reinforcement Learning for Large-Scale Distributed File Systems
    Huang, Xiuqi
    Gao, Yuanning
    Zhou, Xinyi
    Gao, Xiaofeng
    Chen, Guihai
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (06) : 2840 - 2853
  • [45] Energy-aware task scheduling optimization with deep reinforcement learning for large-scale heterogeneous systems
    Li, Jingbo
    Zhang, Xingjun
    Wei, Zheng
    Wei, Jia
    Ji, Zeyu
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2021, 3 (04) : 383 - 392
  • [46] A Spatial-Temporal Deep Reinforcement Learning Model for Large-Scale Centralized Traffic Signal Control
    Yi, Chenglin
    Wu, Jia
    Ren, Yanyu
    Ran, Yunchuan
    Lou, Yican
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 275 - 280
  • [47] Energy-aware task scheduling optimization with deep reinforcement learning for large-scale heterogeneous systems
    Jingbo Li
    Xingjun Zhang
    Zheng Wei
    Jia Wei
    Zeyu Ji
    CCF Transactions on High Performance Computing, 2021, 3 : 383 - 392
  • [48] Multi-task deep reinforcement learning for dynamic scheduling of large-scale fleets in earthmoving operations
    Zhang, Yunuo
    Zhang, Jun
    Wang, Xiaoling
    Zeng, Tuocheng
    AUTOMATION IN CONSTRUCTION, 2025, 174
  • [49] DRESIA: Deep Reinforcement Learning-Enabled Gray Box Approach for Large-Scale Dynamic Cyber-Twin System Simulation
    Lin, Zhouyang
    Li, Kai
    Yang, Yang
    Sun, Fanglei
    Wu, Liantao
    Shi, Panpan
    Ci, Song
    Zuo, Yong
    IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2021, 2 : 321 - 333
  • [50] Information-theoretic sensor planning for large-scale production surveillance via deep reinforcement learning
    Tewari, Ashutosh
    Liu, Kuang-Hung
    Papageorgiou, Dimitri
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 141