Fuzzy Reinforcement Learning and Curriculum Transfer Learning for Micromanagement in Multi-Robot Confrontation

被引:4
作者
Hu, Chunyang [1 ]
Xu, Meng [2 ]
机构
[1] Hubei Univ Arts & Sci, Sch Comp Engn, Xiangyang 441053, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
关键词
multi-robot confrontation; fuzzy reinforcement learning; curriculum transfer learning; neural network; INTELLIGENCE; FRAMEWORK;
D O I
10.3390/info10110341
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-Robot Confrontation on physics-based simulators is a complex and time-consuming task, but simulators are required to evaluate the performance of the advanced algorithms. Recently, a few advanced algorithms have been able to produce considerably complex levels in the context of the robot confrontation system when the agents are facing multiple opponents. Meanwhile, the current confrontation decision-making system suffers from difficulties in optimization and generalization. In this paper, a fuzzy reinforcement learning (RL) and the curriculum transfer learning are applied to the micromanagement for robot confrontation system. Firstly, an improved Q-learning in the semi-Markov decision-making process is designed to train the agent and an efficient RL model is defined to avoid the curse of dimensionality. Secondly, a multi-agent RL algorithm with parameter sharing is proposed to train the agents. We use a neural network with adaptive momentum acceleration as a function approximator to estimate the state-action function. Then, a method of fuzzy logic is used to regulate the learning rate of RL. Thirdly, a curriculum transfer learning method is used to extend the RL model to more difficult scenarios, which ensures the generalization of the decision-making system. The experimental results show that the proposed method is effective.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Research on Fuzzy Reinforcement Learning Algorithm for Agents in Grids
    Li, FuFang
    Luo, Fei
    Gao, Ying
    Qi, Deyu
    Hu, JingLin
    IITAW: 2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATIONS WORKSHOPS, 2009, : 336 - +
  • [22] A Conceptual Framework of Decentralized Learning Neural Network Control Approach for Multi-Robot Cooperation in an Object Balancing Task
    Sumroum, Nattapon Jai
    Chotiprayanakul, Pholchai
    Limnararat, Sunpasit
    2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2016, : 434 - 437
  • [23] Transfer of Robot Perception Module With Adversarial Learning
    Sui, Hongjian
    Shang, Weiwei
    Li, Xiang
    IEEE ACCESS, 2019, 7 : 79726 - 79736
  • [24] Skill based transfer learning with domain adaptation for continuous reinforcement learning domains
    Shoeleh, Farzaneh
    Asadpour, Masoud
    APPLIED INTELLIGENCE, 2020, 50 (02) : 502 - 518
  • [25] Graph based skill acquisition and transfer Learning for continuous reinforcement learning domains
    Shoeleh, Farzaneh
    Asadpour, Masoud
    PATTERN RECOGNITION LETTERS, 2017, 87 : 104 - 116
  • [26] A Probabilistic Fuzzy Controller with Operant Learning for Robot Navigation
    Gao, Yuanyuan
    Ruan, Xiaogang
    Li, Bin
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 368 - 373
  • [27] Neural network Reinforcement Learning for visual control of robot manipulators
    Miljkovic, Zoran
    Mitic, Marko
    Lazarevic, Mihailo
    Babic, Bojan
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) : 1721 - 1736
  • [28] Skills' learning in an autonomous mobile robot using continuous reinforcement
    Boada, MJL
    Salichs, MA
    ADVANCED FUZZY-NEURAL CONTROL 2001, 2002, : 117 - 122
  • [29] Reinforcement learning for facilitating human-robot-interaction in manufacturing
    Oliff, Harley
    Liu, Ying
    Kumar, Maneesh
    Williams, Michael
    Ryan, Michael
    JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 : 326 - 340
  • [30] Biological robot arm motion-through reinforcement learning
    Izawa, J
    Kondo, T
    Ito, K
    2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 3398 - 3403