A Huber reward function-driven deep reinforcement learning solution for cart-pole balancing problem

被引：2

作者：

Mishra, Shaili ^{[1
]}

Arora, Anuja ^{[1
]}

机构：

[1] Jaypee Inst Informat Technol, Dept Comp Sci Engn & Informat Technol, Noida, India

来源：

NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 23期

关键词：

Reinforcement learning; Cart-pole problem; Q-learning; Deep Q learning; DQN; Double DQN; INVERTED PENDULUM; DESIGN;

D O I：

10.1007/s00521-022-07606-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Lots of learning tasks require experience learning based on activities performed in real scenarios which are affected by environmental factors. Therefore, real-time systems demand a model to learn from working experience-such as physical object properties-driven system models, trajectory prediction, and Atari games. This experience-driven learning model uses reinforcement learning which is considered as an important research topic and needs problem-specific reasoning model simulation. In this research paper, cart-pole balancing problem is selected as a problem where the system learns using Q-learning and Deep Q network reinforcement learning approaches. Pragmatic foundation of cart-pole problem and its solution with the help of Q learning and DQN reinforcement learning model are validated, and a comparison of achieved outcome in the form of accuracy and fast convergence is presented. An unexperienced Huber loss function is applied on cart-pole balancing problem, and results are in favor of Huber loss function in comparison with mean-squared error loss function. Hence, experimental study suggests the use of DQN with Huber loss reward function for fast learning and convergence of cart pole in balanced condition.

引用

页码：16705 / 16722

页数：18

共 13 条

[1] A Huber reward function-driven deep reinforcement learning solution for cart-pole balancing problem
Shaili Mishra
Anuja Arora
Neural Computing and Applications, 2023, 35 : 16705 - 16722
[2] Comparison of Reinforcement Learning Algorithms applied to the Cart-Pole Problem
Nagendra, Savinay
Podila, Nikhil
Ugarakhod, Rashmi
George, Koshy
2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 26 - 32
[3] A Parametric Study of a Deep Reinforcement Learning Control System Applied to the Swing-Up Problem of the Cart-Pole
Manrique Escobar, Camilo Andres
Pappalardo, Carmine Maria
Guida, Domenico
APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 19
[4] Opponent cart-pole dynamics for reinforcement learning of competing agents
Huang, Xun
ACTA MECHANICA SINICA, 2022, 38 (05)
[5] A Comparison Study Between Deep Learning and Genetic Programming Application in Cart Pole Balancing Problem
Miranda, Icaro Marcelino
Ladeira, Marcelo
Aranha, Claus de Castro
2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 1657 - 1663
[6] Occupancy Reward-Driven Exploration with Deep Reinforcement Learning for Mobile Robot System
Kamalova, Albina
Lee, Suk Gyu
Kwon, Soon Hak
APPLIED SCIENCES-BASEL, 2022, 12 (18):
[7] Data Driven Solution to Market Equilibrium via Deep Reinforcement Learning
Wen, Lin
Wang, Jianxiao
Lin, Li
Zou, Yang
Gao, Feng
Hong, Qiteng
2024 IEEE 2ND INTERNATIONAL CONFERENCE ON POWER SCIENCE AND TECHNOLOGY, ICPST 2024, 2024, : 1422 - 1426
[8] Opponent cart-pole dynamics for reinforcement learning of competing agents面向竞争多智能体增强学习的对抗倒立摆动力学系统
Xun Huang
Acta Mechanica Sinica, 2022, 38
[9] Transformable Gaussian Reward Function for Socially Aware Navigation Using Deep Reinforcement Learning
Kim, Jinyeob
Kang, Sumin
Yang, Sungwoo
Kim, Beomjoon
Yura, Jargalbaatar
Kim, Donghan
SENSORS, 2024, 24 (14)
[10] Applying Deep Reinforcement Learning to Cable Driven Parallel Robots for Balancing Unstable Loads: A Ball Case Study
Grimshaw, Alex
Oyekan, John
FRONTIERS IN ROBOTICS AND AI, 2021, 7

← 1 2 →