Interactive Reinforcement Learning Strategy

被引:1
作者
Shi, Zhenjie [1 ]
Ma, Wenming [1 ]
Yin, Shuai [1 ]
Zhang, Hailiang [1 ]
Zhao, Xiaofan [1 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai, Peoples R China
来源
2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021) | 2021年
关键词
Reinforcement learning; interactive learning; path planning; Q-learning;
D O I
10.1109/SWC50871.2021.00075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The birth of AlphaGo has set off a new wave of reinforcement learning technology. Reinforcement learning has become one of the most popular directions in the field of artificial intelligence. Its essence is the continuous integration and upgrading of various machine learning methods, and the agents continue to trial and error and obtain cumulative rewards. Q-learning is the most commonly used method in reinforcement learning, but it itself has many problems such as less early information, long learning time, low learning efficiency, and repeated trial and error. Therefore, Q-learning cannot be directly applied to the real environment. In response to this problem, the reinforcement learning discussed by the author is an interactive learning method that combines voice commands and Q-learning. This method uses part of the interaction between the agent and the human voice to find a larger target range in the early stage of learning. Then narrow the search range in turn, which can guide the agent to quickly achieve the learning effect and change the blindness of learning. Simulation experiments show that compared with the standard Q-learning algorithm, the proposed algorithm not only improves the convergence speed, shortens the learning time, but also reduces the number of collisions, enabling the agent to quickly find a better collision-free path.
引用
收藏
页码:507 / 512
页数:6
相关论文
共 50 条
  • [21] Nonstrict Hierarchical Reinforcement Learning for Interactive Systems and Robots
    Cuayahuitl, Heriberto
    Kruijff-Korbayova, Ivana
    Dethlefs, Nina
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2014, 4 (03)
  • [22] Reinforcement Learning-Based Interactive Video Search
    Ma, Zhixin
    Wu, Jiaxin
    Hou, Zhijian
    Ngo, Chong-Wah
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 549 - 555
  • [23] Persistent rule-based interactive reinforcement learning
    Bignold, Adam
    Cruz, Francisco
    Dazeley, Richard
    Vamplew, Peter
    Foale, Cameron
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (32) : 23411 - 23428
  • [24] An Evaluation Methodology for Interactive Reinforcement Learning with Simulated Users
    Bignold, Adam
    Cruz, Francisco
    Dazeley, Richard
    Vamplew, Peter
    Foale, Cameron
    BIOMIMETICS, 2021, 6 (01) : 1 - 15
  • [25] Reinforcement learning of ballistic maneuver adjustment strategy after missile penetration
    Fan B.
    Chen G.
    Han L.
    Li B.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2024, 46 (02): : 94 - 103
  • [26] Information Release Strategy of Urban Rail Transit Based on Reinforcement Learning
    Jia F.-F.
    Jiang X.
    Li H.-Y.
    Yu X.-Q.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2020, 20 (05): : 72 - 78
  • [27] Reinforcement learning with modified exploration strategy for mobile robot path planning
    Khlif, Nesrine
    Nahla, Khraief
    Safya, Belghith
    ROBOTICA, 2023, 41 (09) : 2688 - 2702
  • [28] Intelligent Algorithmic Trading Strategy Using Reinforcement Learning and Directional Change
    Aloud, Monira Essa
    Alkhamees, Nora
    IEEE ACCESS, 2021, 9 : 114659 - 114671
  • [29] Reinforcement-Learning-Based Path Planning: A Reward Function Strategy
    Jaramillo-Martinez, Ramon
    Chavero-Navarrete, Ernesto
    Ibarra-Perez, Teodoro
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [30] The vehicle speed strategy with double traffic lights based on reinforcement learning
    Chen, Kaixuan
    Wu, Guangqiang
    Peng, Shang
    Zeng, Xiang
    Ju, Lijuan
    INTERNATIONAL JOURNAL OF VEHICLE PERFORMANCE, 2023, 9 (03) : 250 - 271