Learning through Imitation and Reinforcement Learning: Toward the Acquisition of Painting Motions

被引:2
作者
Sakato, Tatsuya [1 ]
Ozeki, Motoyuki [1 ]
Oka, Natsuki [1 ]
机构
[1] Kyoto Inst Technol, Grad Sch Sci & Technol, Kyoto 606, Japan
来源
2014 IIAI 3RD INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2014) | 2014年
关键词
imitation; autonomous agent; adaptation;
D O I
10.1109/IIAI-AAI.2014.174
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Learning is essential for an autonomous agent to adapt to an environment. One method of learning is through trial and error; however, this method is impractical in a complex environment because of the long learning time required by the agent. Therefore, guidelines are necessary in order to expedite the learning process in such environments, and imitation is one such guideline. Sakato, Ozeki, and Oka (2012-2013) recently proposed a computational model of imitation and autonomous behavior by which an agent can reduce its learning time through imitation. They evaluate the model in discrete and continuous spaces, and apply the model to a real robot in order to acquire painting skills. Their experimental results indicate that the model adapted to the experimental environment by imitation. In this paper, we introduce the model and discuss what are needed to improve the model.
引用
收藏
页码:873 / 880
页数:8
相关论文
共 12 条
[1]   Imitation with ALICE: Learning to imitate corresponding actions across dissimilar embodiments [J].
Alissandrakis, A ;
Nehaniv, CL ;
Dautenhahn, K .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2002, 32 (04) :482-496
[2]   Discovering optimal imitation strategies [J].
Billard, A ;
Epars, Y ;
Calinon, S ;
Schaal, S ;
Cheng, G .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2004, 47 (2-3) :69-77
[3]  
Kuniyoshi Y., 2007, J ROBOTICS SOC JAPAN, V25, P671
[4]  
Nehaniv CL, 2000, WSS ROB INTELL SYST, V24, P136
[5]  
Ng AY, 1999, MACHINE LEARNING, PROCEEDINGS, P278
[6]  
Price B, 1999, MACHINE LEARNING, PROCEEDINGS, P325
[7]  
Sakato T., 2013, STUDIES COMPUTATIONA, V492, P37
[8]  
Sakato T., 2012, SOFTW ENG ART INT NE, V8, P13
[9]   Learning Which Features to Imitate in a Painting Task [J].
Sakato, Tatsuya ;
Ozeki, Motoyuki ;
Oka, Natsuki .
2013 SECOND IIAI INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2013), 2013, :379-384
[10]  
Sutton R.S., 2017, Introduction to reinforcement learning