How an adaptive learning rate benefits neuro-fuzzy reinforcement learning systems

被引:2
作者
Kuremoto, Takashi [1 ]
Obayashi, Masanao [1 ]
Kobayashi, Kunikazu [2 ]
Mabu, Shingo [1 ]
机构
[1] Graduate School of Science and Engineering, Yamaguchi University Tokiwadai 2-16-1, Ube,Yamaguchi
[2] Schoo of Information Science and Technology, Aichi Prefectural University Ibaragabasama 1522-3, Nagakute-Shi,Aichi
来源
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2014年 / 8794卷
关键词
Adaptive learning rate (ALR); Goal-exploration problem; Multi-agent system (MAS); Neuro-fuzzy system; Reinforcement learning (RL); Swarm behavior;
D O I
10.1007/978-3-319-11857-4_37
中图分类号
学科分类号
摘要
To acquire adaptive behaviors of multiple agents in the unknown environment, several neuro-fuzzy reinforcement learning systems (NFRLSs) have been proposed Kuremoto et al. Meanwhile, to manage the balance between exploration and exploitation in fuzzy reinforcement learning (FRL), an adaptive learning rate (ALR), which adjusting learning rate by considering “fuzzy visit value” of the current state, was proposed by Derhami et al. recently. In this paper, we intend to show how the ALR accelerates some NFRLSs which are reinforcement learning systems with a self-organizing fuzzy neural network (SOFNN) and different learning methods including actor-critic learning (ACL), and Sarsa learning (SL). Simulation results of goal-exploration problems showed the powerful effect of the ALR comparing with the conventional empirical fixed learning rates. © Springer International Publishing Switzerland 2014.
引用
收藏
页码:324 / 331
页数:7
相关论文
共 8 条
[1]  
Kaelbling L.P., Littman M.L., Reinforcement Learning: A Survey, J. Artificial Intelligence Research, 4, pp. 237-285, (1996)
[2]  
Sutton R.S., Barto A.G., Reinforcement Learning: An Introduction, (1998)
[3]  
Samejima K., Omori T., Adaptive Internal State Space Construction Method for Reinforcement Learning of a Real-World Agent, Neural Networks, 12, pp. 1143-1155, (1999)
[4]  
Wang X.S., Cheng Y.H., Yi J.Q., A fuzzy Actor–Critic Reinforcement Learning Network, Information Sciences, 177, pp. 3764-3781, (2007)
[5]  
Kuremoto T., Obayashi M., Kobayashi K., Adaptive Swarm Behavior Acquisition by a Neuro-Fuzzy System and Reinforcement Learning Algorithm, Intern. J. Of Intelligent Computing and Cybernetics, 2, 4, pp. 724-744, (2009)
[6]  
Kuremoto T., Yamano Y., Obayashi M., Kobayashi K., An Improved Internal Model for Swarm Formation and Adaptive Swarm Behavior Acquisition, J. Of Circuits, Systems, and Computers, 18, 8, pp. 1517-1531, (2009)
[7]  
Kuremoto T., Yamano Y., Feng L.-B., Kobayashi K., Obayashi M., A Fuzzy Neural Network with Reinforcement Learning Algorithm for Swarm Learning, Future Computing, Communication, Control and Management. LNEE, 144, pp. 101-108, (2011)
[8]  
Derhami V., Majd V.J., Ahmadabadi M.N., Exploration and Exploitation Balance Management in Fuzzy Reinforcement Learning, Fuzzy Sets and Systems, 161, 4, pp. 578-595, (2010)