Relationship Between the Order for Motor Skill Transfer and Motion Complexity in Reinforcement Learning

被引:6
作者
Cho, Nam Jun [1 ]
Lee, Sang Hyoung [2 ]
Suh, Il Hong [1 ]
Kim, Hong-Seok [2 ]
机构
[1] Hanyang Univ, Dept Elect & Comp Engn, Seoul 04763, South Korea
[2] Korea Inst Ind Technol, Smart Res Grp, Cheonan 31056, South Korea
关键词
Motor skill transfer; ordering; motion complexity; reinforcement learning; robot manipulation; IMITATION; ROBOTICS;
D O I
10.1109/LRA.2018.2889026
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We propose a method to generate an order for learning and transferring motor skills based on motion complexity, then evaluate the order to learn motor skills of a task and transfer them to another task as a form of reinforcement learning (RL). Here, motion complexity refers to the complexity calculated from multiple motion trajectories of a task. To do this, multiple human demonstrations are extracted and clustered to calculate motion complexity and identify the motor skills involved in a task. The motion trajectories of the task are then used to calculate the motion complexity considering temporal entropy and spatial entropy. Finally, both orders [Simple-to-Complex] and [Complex-to-Simple] are generated to learn and transfer motor skills based on the motion complexities of multiple tasks. To evaluate these orders, two tasks [Drawing] and [Fitting] are performed using an actual robotic arm. To verify the learning and transfer processes, we apply our method to three different figures as well as to pegs and holes of three different shapes and analyze the experimental results. In addition, we provide guidelines for using the [Simple-to-Complex] and [Complex-to-Simple] orders in RL.
引用
收藏
页码:293 / 300
页数:8
相关论文
共 17 条
[1]  
An SI, 2015, IEEE INT CONF ROBOT, P1423, DOI 10.1109/ICRA.2015.7139376
[2]   Purposive behavior acquisition for a real robot by vision-based reinforcement learning [J].
Asada, M ;
Noda, S ;
Tawaratsumida, S ;
Hosoda, K .
MACHINE LEARNING, 1996, 23 (2-3) :279-303
[3]   Training in robotics: The learning curve and contemporary concepts in training [J].
Bach, Christian ;
Miernik, Arkadiusz ;
Schoenthaler, Martin .
ARAB JOURNAL OF UROLOGY, 2014, 12 (01) :58-61
[4]   The effects of task difficulty, novelty and the size of the search space on intrinsically motivated exploration [J].
Baranes, Adrien F. ;
Oudeyer, Pierre-Yves ;
Gottlieb, Jacqueline .
FRONTIERS IN NEUROSCIENCE, 2014, 8
[5]   iLab-20M: A large-scale controlled object dataset to investigate deep learning [J].
Borji, Ali ;
Izadi, Saeed ;
Itti, Laurent .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2221-2230
[6]   Learning and Reproduction of Gestures by Imitation An Approach Based on Hidden Markov Model and Gaussian Mixture Regression [J].
Calinon, Sylvain ;
D'Halluin, Florent ;
Sauser, Eric L. ;
Caldwell, Darwin G. ;
Billard, Aude G. .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2010, 17 (02) :44-54
[7]  
Fichtl S., 2012, PALADYN, V3, P188
[8]   General surgery training and robotics: Are residents improving their skills? [J].
Finnerty, Brendan M. ;
Afaneh, Cheguevara ;
Aronova, Anna ;
Fahey, Thomas J., III ;
Zarnegar, Rasa .
SURGICAL ENDOSCOPY AND OTHER INTERVENTIONAL TECHNIQUES, 2016, 30 (02) :567-573
[9]   Complex Skill Training Transfers to Improved Performance and Control of Simpler Tasks After Stroke [J].
Kantak, Shailesh S. ;
Zahedi, Nazaneen ;
McGrath, Robert .
PHYSICAL THERAPY, 2017, 97 (07) :718-728
[10]  
Kober J, 2010, IEEE ROBOT AUTOM MAG, V17, P55, DOI 10.1109/MRA.2010.936952