Network Parameter Setting for Reinforcement Learning Approaches Using Neural Networks

被引:2
作者
Yamada, Kazuaki [1 ]
机构
[1] Toyo Univ, Fac Sci & Engn, Dept Mech Engn, 2100 Kujirai, Kawagoe, Saitama 3508585, Japan
关键词
reinforcement learning; artificial neural networks; autonomous mobile robot;
D O I
10.20965/jaciii.2011.p0822
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning approaches are attracting attention as a technique for constructing a trial-and-error mapping function between sensors and motors of an autonomous mobile robot. Conventional reinforcement learning approaches use a look-up table to express the mapping function between grid state and grid action spaces. The grid size greatly adversely affects the learning performance of reinforcement learning algorithms. To avoid this, researchers have proposed reinforcement learning algorithms using neural networks to express the mapping function between continuous state space and action. A designer, however, must set the number of middle neurons and initial values of weight parameters appropriately to improve the approximate accuracy of neural networks. This paper proposes a new method that automatically sets the number of middle neurons and initial values of weight parameters based on the dimension number of the sensor space. The feasibility of proposed method is demonstrated using an autonomous mobile robot navigation problem and is evaluated by comparing it with two types of Q-learning as follows: Q-learning using RBF networks and Q-learning using neural networks whose parameters are set by a designer.
引用
收藏
页码:822 / 830
页数:9
相关论文
共 50 条
  • [41] Application of reinforcement learning and neural network in robot navigation
    孟伟
    洪炳熔
    Journal of Harbin Institute of Technology, 2001, (03) : 283 - 286
  • [42] Investigating the properties of neural network representations in reinforcement learning
    Wang, Han
    Miahi, Erfan
    White, Martha
    Machado, Marlos C.
    Abbas, Zaheer
    Kumaraswamy, Raksha
    Liu, Vincent
    White, Adam
    ARTIFICIAL INTELLIGENCE, 2024, 330
  • [43] Optical neural networks trained in situ with reinforcement learning
    Neill, Oliver
    Faccio, Daniele
    MACHINE LEARNING IN PHOTONICS, 2024, 13017
  • [44] Nonlinear Control System with Reinforcement Learning and Neural Networks Based Lyapunov Functions
    Rego, Rosana Cibely Batista
    Araujo, Fabio Meneghetti Ugulino de
    IEEE LATIN AMERICA TRANSACTIONS, 2021, 19 (08) : 1253 - 1260
  • [45] Forming Adversarial Example Attacks Against Deep Neural Networks With Reinforcement Learning
    Akers, Matthew
    Barton, Armon
    COMPUTER, 2024, 57 (01) : 88 - 99
  • [46] Graph Partitioning and Sparse Matrix Ordering using Reinforcement Learning and Graph Neural Networks
    Gatti, Alice
    Hu, Zhixiong
    Smidt, Tess
    Ng, Esmond G.
    Ghysels, Pieter
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [47] Simulating Human Behavior in Fighting Games using Reinforcement Learning and Artificial Neural Networks
    Mendonca, Matheus R. F.
    Bernardino, Heder S.
    Neto, Raul F.
    2015 14TH BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES), 2016, : 152 - 159
  • [48] Car Sales Prediction Using Gated Recurrent Units Neural Networks with Reinforcement Learning
    Zhu, Bowen
    Dong, Huailong
    Zhang, Jing
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING, PT II, 2019, 11936 : 312 - 324
  • [49] Robust reinforcement learning control using integral quadratic constraints for recurrent neural networks
    Anderson, Charles W.
    Young, Peter Michael
    Buehner, Michael R.
    Knight, James N.
    Bush, Keith A.
    Hittle, Douglas C.
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (04): : 993 - 1002
  • [50] Using Federated Learning Techniques to Generalize Reinforcement Learning Approaches
    Tellaeche Iglesias, Alberto
    Fidalgo Astorquia, Ignacio
    Vazquez, Juan-Ignacio
    Gaviria de la Puerta, Jose
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT II, HAIS 2024, 2025, 14858 : 292 - 303