Trajectory Optimization for Autonomous Flying Base Station via Reinforcement Learning

被引:0
|
作者
Bayerlein, Harald [1 ]
de Kerret, Paul [1 ]
Gesbert, David [1 ]
机构
[1] EURECOM, Commun Syst Dept, Sophia Antipolis, France
来源
2018 IEEE 19TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC) | 2018年
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this work, we study the optimal trajectory of an unmanned aerial vehicle (UAV) acting as a base station (BS) to serve multiple users. Considering multiple flying epochs, we leverage the tools of reinforcement learning (RL) with the UAV acting as an autonomous agent in the environment to learn the trajectory that maximizes the sum rate of the transmission during flying time. By applying Q-learning, a model-free RL technique, an agent is trained to make movement decisions for the UAV. We compare table-based and neural network (NN) approximations of the Q-function and analyze the results. In contrast to previous works, movement decisions are directly made by the neural network and the algorithm requires no explicit information about the environment and is able to learn the topology of the network to improve the system-wide performance.
引用
收藏
页码:945 / 949
页数:5
相关论文
共 50 条
  • [1] Optimizing Flying Base Station Connectivity by RAN Slicing and Reinforcement Learning
    Melgarejo, Dick Carrillo
    Pokorny, Jiri
    Seda, Pavel
    Narayanan, Arun
    Nardelli, Pedro H. J.
    Rasti, Mehdi
    Hosek, Jiri
    Seda, Milos
    Rodriguez, Demostenes Z.
    Koucheryavy, Yevgeni
    Fraidenraich, Gustavo
    IEEE ACCESS, 2022, 10 : 53746 - 53760
  • [2] Online Trajectory Optimization for the UAV-Enabled Base Station Multicasting System Based on Reinforcement Learning
    Zhang Guangchi
    Yan Yulin
    Cui Miao
    Chen Wei
    Zhang Jing
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (03) : 969 - 975
  • [3] Efficient Reinforcement Learning via Probabilistic Trajectory Optimization
    Pan, Yunpeng
    Boutselis, George, I
    Theodorou, Evangelos A.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5459 - 5474
  • [4] Learning to Rest: A Q-Learning Approach to Flying Base Station Trajectory Design with Landing Spots
    Bayerlein, Harald
    Gangula, Rajeev
    Gesbert, David
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 724 - 728
  • [5] Reinforcement Learning Aided UAV Base Station Location Optimization for Rate Maximization
    Gopi, Sudheesh Puthenveettil
    Magarini, Maurizio
    ELECTRONICS, 2021, 10 (23)
  • [6] Mobility-Aware Trajectory Design for Aerial Base Station Using Deep Reinforcement Learning
    Hao, Guoliang
    Ni, Wanli
    Tian, Hui
    Cao, Leilei
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1131 - 1136
  • [7] Autonomous Navigation and Configuration of Integrated Access Backhauling for UAV Base Station Using Reinforcement Learning
    Zhang, Hongyi
    Li, Jingya
    Qi, Zhiqiang
    Lin, Xingqin
    Aronsson, Anders
    Bosch, Jan
    Olsson, Helena Holmstrom
    2022 IEEE FUTURE NETWORKS WORLD FORUM, FNWF, 2022, : 184 - 189
  • [8] Optimization of Cell Individual Offset for Handover of Flying Base Station
    Madelkhanova, Aida
    Becvar, Zdenek
    2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,
  • [9] Autonomous flying of drone based on ppo reinforcement learning algorithm
    Park S.G.
    Kim D.H.
    Kim, Dong Hwan (dhkim@seoultech.ac.kr), 1600, Institute of Control, Robotics and Systems (26): : 955 - 963
  • [10] Reinforcement learning optimization for base station sleeping strategy in coordinated multipoint (CoMP) communications
    Wen, Shuhuan
    Hu, Baozhu
    Lam, H. K.
    NEUROCOMPUTING, 2015, 167 : 443 - 450