A Cyclic Hyper-parameter Selection Approach for Reinforcement Learning-based UAV Path Planning

被引:0
作者
Jones, Michael R. [1 ]
Djahel, Soufiene [2 ]
Welsh, Kristopher [1 ]
机构
[1] Manchester Metropolitan Univ, Dept Comp & Math, Manchester, Lancs, England
[2] Univ Huddersfield, Sch Comp & Engn, Huddersfield, W Yorkshire, England
来源
2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC | 2024年
关键词
UAV; unmanned aerial vehicles; reinforcement learning; q-learning; hyper-parameters; path planning; self-tuning;
D O I
10.1109/CCNC51664.2024.10454801
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned Aerial Vehicles (UAVs) offer new ways to fulfil a variety of urban transportation and service solutions. The ability to successfully plan and re-plan paths across a complex urban environment remains an unsolved significant problem. New Q-learning approaches have potential to address this problem, however they must first learn complex environment spaces. A traditional challenge within this field is the selection of suitable learning hyper-parameters that assist a Q-learning algorithm in achieving an optimal learning policy. It is known that testing and evaluating multiple hyper-parameter combinations is computationally expensive. Thus, this paper proposes a new method for hyper-parameter self-tuning, cyclically assigning hyper-parameters within a single learning process, eliminating the need to experimentally seek optimal hyper-parameter value combinations. Evaluation of the captured results show, training with cyclical hyper-parameter exploration instead of fixed values, achieves improved path generation, while reducing the cumulative learning time required. Although the focus of this approach is centred around a Multi Q-table Path Planning solution, this work presents a practical tool applicable to Reinforcement Learning techniques generally.
引用
收藏
页码:792 / 798
页数:7
相关论文
共 14 条
[1]  
Bengio Yoshua, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P437, DOI 10.1007/978-3-642-35289-8_26
[2]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[3]  
Barsce JC, 2017, 2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI)
[4]  
Fran‡ois-Lavet V, 2016, Arxiv, DOI arXiv:1512.02011
[5]   Beyond Manual Tuning of Hyperparameters [J].
Hutter, Frank ;
Luecke, Joerg ;
Schmidt-Thieme, Lars .
KUNSTLICHE INTELLIGENZ, 2015, 29 (04) :329-337
[6]  
Jones M. R., 2022, P IEEE 19 ANN CONS C, P457
[7]  
Lu Z., 2019, arXiv
[8]  
Breuel TM, 2015, Arxiv, DOI [arXiv:1508.02788, DOI 10.48550/ARXIV.1508.02788]
[9]  
Oller D, 2020, Arxiv, DOI arXiv:2004.07707
[10]   Cyclical Learning Rates for Training Neural Networks [J].
Smith, Leslie N. .
2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :464-472