Fuzzy Reinforcement Learning and Curriculum Transfer Learning for Micromanagement in Multi-Robot Confrontation

被引：4

作者：

Hu, Chunyang ^{[1
]}

Xu, Meng ^{[2
]}

机构：

[1] Hubei Univ Arts & Sci, Sch Comp Engn, Xiangyang 441053, Peoples R China

[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China

来源：

INFORMATION | 2019年 / 10卷 / 11期

关键词：

multi-robot confrontation; fuzzy reinforcement learning; curriculum transfer learning; neural network; INTELLIGENCE; FRAMEWORK;

D O I：

10.3390/info10110341

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-Robot Confrontation on physics-based simulators is a complex and time-consuming task, but simulators are required to evaluate the performance of the advanced algorithms. Recently, a few advanced algorithms have been able to produce considerably complex levels in the context of the robot confrontation system when the agents are facing multiple opponents. Meanwhile, the current confrontation decision-making system suffers from difficulties in optimization and generalization. In this paper, a fuzzy reinforcement learning (RL) and the curriculum transfer learning are applied to the micromanagement for robot confrontation system. Firstly, an improved Q-learning in the semi-Markov decision-making process is designed to train the agent and an efficient RL model is defined to avoid the curse of dimensionality. Secondly, a multi-agent RL algorithm with parameter sharing is proposed to train the agents. We use a neural network with adaptive momentum acceleration as a function approximator to estimate the state-action function. Then, a method of fuzzy logic is used to regulate the learning rate of RL. Thirdly, a curriculum transfer learning method is used to extend the RL model to more difficult scenarios, which ensures the generalization of the decision-making system. The experimental results show that the proposed method is effective.

引用

页数：22

共 50 条

[21] A Conceptual Framework of Decentralized Learning Neural Network Control Approach for Multi-Robot Cooperation in an Object Balancing Task [J].

Sumroum, Nattapon Jai ;

Chotiprayanakul, Pholchai ;

Limnararat, Sunpasit .

2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2016, :434-437

[22] Transfer of Robot Perception Module With Adversarial Learning [J].

Sui, Hongjian ;

Shang, Weiwei ;

Li, Xiang .

IEEE ACCESS, 2019, 7 :79726-79736

[23] Graph based skill acquisition and transfer Learning for continuous reinforcement learning domains [J].

Shoeleh, Farzaneh ;

Asadpour, Masoud .

PATTERN RECOGNITION LETTERS, 2017, 87 :104-116

[24] Skill based transfer learning with domain adaptation for continuous reinforcement learning domains [J].

Shoeleh, Farzaneh ;

Asadpour, Masoud .

APPLIED INTELLIGENCE, 2020, 50 (02) :502-518

[25] A Probabilistic Fuzzy Controller with Operant Learning for Robot Navigation [J].

Gao, Yuanyuan ;

Ruan, Xiaogang ;

Li, Bin .

PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, :368-373

[26] Intelligent Path Planning of Underwater Robot Based on Reinforcement Learning [J].

Yang, Jiachen ;

Ni, Jingfei ;

Xi, Meng ;

Wen, Jiabao ;

Li, Yang .

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) :1983-1996

[27] Neural network Reinforcement Learning for visual control of robot manipulators [J].

Miljkovic, Zoran ;

Mitic, Marko ;

Lazarevic, Mihailo ;

Babic, Bojan .

EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) :1721-1736

[28] Skills' learning in an autonomous mobile robot using continuous reinforcement [J].

Boada, MJL ;

Salichs, MA .

ADVANCED FUZZY-NEURAL CONTROL 2001, 2002, :117-122

[29] Biological robot arm motion-through reinforcement learning [J].

Izawa, J ;

Kondo, T ;

Ito, K .

2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, :3398-3403

[30] Reinforcement learning for facilitating human-robot-interaction in manufacturing [J].

Oliff, Harley ;

Liu, Ying ;

Kumar, Maneesh ;

Williams, Michael ;

Ryan, Michael .

JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 :326-340

← 1 2 3 4 5 →