Trajectory Optimization for Autonomous Flying Base Station via Reinforcement Learning

被引:0
|
作者
Bayerlein, Harald [1 ]
de Kerret, Paul [1 ]
Gesbert, David [1 ]
机构
[1] EURECOM, Commun Syst Dept, Sophia Antipolis, France
来源
2018 IEEE 19TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC) | 2018年
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this work, we study the optimal trajectory of an unmanned aerial vehicle (UAV) acting as a base station (BS) to serve multiple users. Considering multiple flying epochs, we leverage the tools of reinforcement learning (RL) with the UAV acting as an autonomous agent in the environment to learn the trajectory that maximizes the sum rate of the transmission during flying time. By applying Q-learning, a model-free RL technique, an agent is trained to make movement decisions for the UAV. We compare table-based and neural network (NN) approximations of the Q-function and analyze the results. In contrast to previous works, movement decisions are directly made by the neural network and the algorithm requires no explicit information about the environment and is able to learn the topology of the network to improve the system-wide performance.
引用
收藏
页码:945 / 949
页数:5
相关论文
共 50 条
  • [21] The Furtherance of Autonomous Engineering via Reinforcement Learning
    Antensteiner, Doris
    Dietrich, Vincent
    Fiegert, Michael
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS (ICINCO), 2021, : 49 - 59
  • [22] Autonomous helicopter flight via reinforcement learning
    Ng, AY
    Kim, HJ
    Jordan, MI
    Sastry, S
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 799 - 806
  • [23] Research on Inverse Reinforcement Learning-Based Trajectory Planning Optimization Mechanism for Autonomous Connected Vehicles
    Peng H.
    Tang M.
    Zha Q.
    Wang C.
    Wang W.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (08): : 820 - 831
  • [24] Autonomous Model Management via Reinforcement Learning
    Liebman, Elad
    Zavesky, Eric
    Stone, Peter
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1601 - 1603
  • [25] Trajectory optimization of spacecraft autonomous far-distance rapid rendezvous based on deep reinforcement learning
    Di, Peng
    Yao, Ye
    Lin, Zheng
    Yin, Zengshan
    ADVANCES IN SPACE RESEARCH, 2025, 75 (01) : 790 - 806
  • [26] Autonomous Predictive Modeling via Reinforcement Learning
    Khurana, Udayan
    Samulowitz, Horst
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3285 - 3288
  • [27] Reinforcement-Learning-Based Trajectory Learning in Frenet Frame for Autonomous Driving
    Yoon, Sangho
    Kwon, Youngjoon
    Ryu, Jaesung
    Kim, Sungkwan
    Choi, Sungwoo
    Lee, Kyungjae
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [28] Model-Free Trajectory Optimization for Reinforcement Learning
    Akrour, Riad
    Abdolmaleki, Abbas
    Abdulsamad, Hany
    Neumann, Gerhard
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [29] Reentry trajectory optimization based on Deep Reinforcement Learning
    Gao, Jiashi
    Shi, Xinming
    Cheng, Zhongtao
    Xiong, Jizhang
    Liu, Lei
    Wang, Yongji
    Yang, Ye
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 2588 - 2592
  • [30] Trajectory optimization using reinforcement, learning for map exploration
    Kollar, Thomas
    Roy, Nicholas
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (02): : 175 - 196