A Navigation Algorithm Based on the Reinforcement Learning Reward System and Optimised with Genetic Algorithm

被引:0
|
作者
Cabezas-Olivenza, Mireya [1 ]
Zulueta, Ekaitz [2 ]
Azurmendi-Marquinez, Iker [3 ]
Fernandez-Gamiz, Unai [4 ]
Rico-Melgosa, Danel [2 ]
机构
[1] Mondragon Unibertsitatea, Fac Engn, Arrasate Mondragon 20500, Spain
[2] Univ Basque Country UPV EHU, Syst Engn & Automat Control Dept, Nieves Cano 12, Vitoria 01006, Spain
[3] CS Ctr Stirling S Coop, Avda Alava 3, Aretxabaleta 20550, Spain
[4] Univ Basque Country UPV EHU, Dept Energy Engn, Nieves Cano 12, Vitoria 01006, Spain
关键词
navigation; reinforcement learning; genetic algorithm; optimisation; autonomous vehicle; q-learning; AGV; MOBILE ROBOT NAVIGATION;
D O I
10.3390/math12244030
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Regarding autonomous vehicle navigation, reinforcement learning is a technique that has demonstrated significant results. Nevertheless, it is a technique with a high number of parameters that need to be optimised without prior information, and correctly performing this is a complicated task. In this research study, a system based on the principles of reinforcement learning, specifically on the concept of rewards, is presented. A mathematical expression was proposed to control the vehicle's direction based on its position, the obstacles in the environment and the destination. In this equation proposal, there was only one unknown parameter that regulated the degree of the action to be taken, and this was optimised through the genetic algorithm. In this way, a less computationally expensive navigation algorithm was presented, as it avoided the use of neural networks. The controller's time to obtain the navigation instructions was around 6.201<middle dot>10-4 s. This algorithm is an efficient and accurate system which manages not to collide with obstacles and to reach the destination from any position. Moreover, in most cases, it has been found that the proposed navigations are also optimal.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] RLGA: A reinforcement learning based genetic algorithm
    Wang, Ben-Nian
    Gao, Yang
    Chen, Zhao-Qian
    Xie, Jun-Yuan
    Chen, Shi-Fu
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2006, 34 (05): : 856 - 860
  • [2] A Hierarchical Reinforcement Learning Algorithm Based On Heuristic Reward Function
    Yan, Qicui
    Liu, Quan
    Hu, Daojing
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 3, 2010, : 371 - 376
  • [3] A genetic algorithm for reinforcement learning
    Zhao, L
    Liu, ZM
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 1056 - 1060
  • [4] Meta-Reinforcement Learning Algorithm Based on Reward and Dynamic Inference
    Chen, Jinhao
    Zhang, Chunhong
    Hu, Zheng
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT III, PAKDD 2024, 2024, 14647 : 223 - 234
  • [5] An investor sentiment reward-based trading system using Gaussian inverse reinforcement learning algorithm
    Yang, Steve Y.
    Yu, Yangyang
    Almandi, Saud
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 388 - 401
  • [6] An Average-Reward Reinforcement Learning Algorithm based on Schweitzer's Transformation
    Li Jianjun
    Ren Jiangong
    Li Yanjie
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 2966 - 2970
  • [7] The Task Decomposition and Dedicated Reward-System-Based Reinforcement Learning Algorithm for Pick-and-Place
    Kim, Byeongjun
    Kwon, Gunam
    Park, Chaneun
    Kwon, Nam Kyu
    BIOMIMETICS, 2023, 8 (02)
  • [8] Mobile Robot Navigation System Using Reinforcement Learning with Path Planning Algorithm
    Andarge, E.W.
    Ordys, A.
    Abebe, Y.M.
    Acta Physica Polonica A, 2024, 146 (04) : 452 - 456
  • [9] Determination of the optimal values of parameters in reinforcement learning for mobile robot navigation by a genetic algorithm
    Kamei, K
    Ishikawa, M
    BRAIN-INSPIRED IT I, 2004, 1269 : 193 - 196
  • [10] An Acquisition Based Optimised Crop Recommendation System with Machine Learning Algorithm
    Choudhury, Sasmita Subhadarsinee
    Pandharbale, Priya B.
    Mohanty, Sachi Nandan
    Jagdev, Alok Kumar
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (01)