A Huber reward function-driven deep reinforcement learning solution for cart-pole balancing problem

被引:2
|
作者
Mishra, Shaili [1 ]
Arora, Anuja [1 ]
机构
[1] Jaypee Inst Informat Technol, Dept Comp Sci Engn & Informat Technol, Noida, India
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 23期
关键词
Reinforcement learning; Cart-pole problem; Q-learning; Deep Q learning; DQN; Double DQN; INVERTED PENDULUM; DESIGN;
D O I
10.1007/s00521-022-07606-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lots of learning tasks require experience learning based on activities performed in real scenarios which are affected by environmental factors. Therefore, real-time systems demand a model to learn from working experience-such as physical object properties-driven system models, trajectory prediction, and Atari games. This experience-driven learning model uses reinforcement learning which is considered as an important research topic and needs problem-specific reasoning model simulation. In this research paper, cart-pole balancing problem is selected as a problem where the system learns using Q-learning and Deep Q network reinforcement learning approaches. Pragmatic foundation of cart-pole problem and its solution with the help of Q learning and DQN reinforcement learning model are validated, and a comparison of achieved outcome in the form of accuracy and fast convergence is presented. An unexperienced Huber loss function is applied on cart-pole balancing problem, and results are in favor of Huber loss function in comparison with mean-squared error loss function. Hence, experimental study suggests the use of DQN with Huber loss reward function for fast learning and convergence of cart pole in balanced condition.
引用
收藏
页码:16705 / 16722
页数:18
相关论文
共 13 条
  • [1] A Huber reward function-driven deep reinforcement learning solution for cart-pole balancing problem
    Shaili Mishra
    Anuja Arora
    Neural Computing and Applications, 2023, 35 : 16705 - 16722
  • [2] Comparison of Reinforcement Learning Algorithms applied to the Cart-Pole Problem
    Nagendra, Savinay
    Podila, Nikhil
    Ugarakhod, Rashmi
    George, Koshy
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 26 - 32
  • [3] A Parametric Study of a Deep Reinforcement Learning Control System Applied to the Swing-Up Problem of the Cart-Pole
    Manrique Escobar, Camilo Andres
    Pappalardo, Carmine Maria
    Guida, Domenico
    APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 19
  • [4] Opponent cart-pole dynamics for reinforcement learning of competing agents
    Huang, Xun
    ACTA MECHANICA SINICA, 2022, 38 (05)
  • [5] A Comparison Study Between Deep Learning and Genetic Programming Application in Cart Pole Balancing Problem
    Miranda, Icaro Marcelino
    Ladeira, Marcelo
    Aranha, Claus de Castro
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 1657 - 1663
  • [6] Occupancy Reward-Driven Exploration with Deep Reinforcement Learning for Mobile Robot System
    Kamalova, Albina
    Lee, Suk Gyu
    Kwon, Soon Hak
    APPLIED SCIENCES-BASEL, 2022, 12 (18):
  • [7] Data Driven Solution to Market Equilibrium via Deep Reinforcement Learning
    Wen, Lin
    Wang, Jianxiao
    Lin, Li
    Zou, Yang
    Gao, Feng
    Hong, Qiteng
    2024 IEEE 2ND INTERNATIONAL CONFERENCE ON POWER SCIENCE AND TECHNOLOGY, ICPST 2024, 2024, : 1422 - 1426
  • [8] Opponent cart-pole dynamics for reinforcement learning of competing agents面向竞争多智能体增强学习的对抗倒立摆动力学系统
    Xun Huang
    Acta Mechanica Sinica, 2022, 38
  • [9] Transformable Gaussian Reward Function for Socially Aware Navigation Using Deep Reinforcement Learning
    Kim, Jinyeob
    Kang, Sumin
    Yang, Sungwoo
    Kim, Beomjoon
    Yura, Jargalbaatar
    Kim, Donghan
    SENSORS, 2024, 24 (14)
  • [10] Applying Deep Reinforcement Learning to Cable Driven Parallel Robots for Balancing Unstable Loads: A Ball Case Study
    Grimshaw, Alex
    Oyekan, John
    FRONTIERS IN ROBOTICS AND AI, 2021, 7