Gradual Acquisition of Feed-Forward Control in Repetitive Motions by State-Independent Reinforcement Learning

被引:0
作者
Mamiya, Haruki [1 ]
Kobayashi, Yuichi [1 ]
机构
[1] Shizuoka Univ, Grad Sch Sci & Technol, Dept Engn, Chuo Ku, 3-5-1 Johoku, Hamamatsu, Shizuoka, Japan
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, AIM 2024 | 2024年
关键词
D O I
10.1109/AIM55361.2024.10637084
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human motor control is characterized by its adaptability to new dynamics. As a result of adaptation, humans can achieve motor control with less computational effort while maintaining achievement of the task. In this paper, we hypothesize that such adaptation can be modeled by acquisition process of a feed-forward control sequence. Based on the hypothesis, we propose a state-independent reinforcement learning model of feed-forward control generation and feedback control reduction. A gradual learning strategy is presented on the basis of state-independent and time-dependent reinforcement learning to improve learning efficiency for repetitive tracking control tasks. The proposed motor learning model was validated in simulation of 2-DOF manipulator tracking control task, where the robot could obtain a state-unaware control sequence under unknown dynamics and external force condition.
引用
收藏
页码:192 / 197
页数:6
相关论文
共 21 条
[1]   Special issue on hyper-adaptability for overcoming body-brain dysfunction PREFACE [J].
An, Qi ;
Hayashi, Yoshikatsu ;
Santello, Marco ;
Ota, Jun .
ADVANCED ROBOTICS, 2021, 35 (13-14) :801-801
[2]   Control Architecture for Human-Like Motion With Applications to Articulated Soft Robots [J].
Angelini, Franco ;
Della Santina, Cosimo ;
Garabini, Manolo ;
Bianchi, Matteo ;
Bicchi, Antonio .
FRONTIERS IN ROBOTICS AND AI, 2020, 7
[3]   Motor Synergy Development in High-Performing Deep Reinforcement Learning Algorithms [J].
Chai, Jiazheng ;
Hayashibe, Mitsuhiro .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1271-1278
[4]   THEORY OF THE HUMAN OPERATOR IN CONTROL SYSTEMS I. THE OPERATOR AS AN ENGINEERING SYSTEM [J].
Craik, Kenneth J. W. .
BRITISH JOURNAL OF PSYCHOLOGY-GENERAL SECTION, 1947, 38 :56-61
[5]  
Gawthrop P. J., 2004, IFAC Proceedings, V37, P121
[6]   Predictive pole-placement control with linear models [J].
Gawthrop, PJ ;
Ronco, E .
AUTOMATICA, 2002, 38 (03) :421-432
[7]   Dynamic primitives in the control of locomotion [J].
Hogan, Neville ;
Sternad, Dagmar .
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2013, 7
[8]   Learning agile and dynamic motor skills for legged robots [J].
Hwangbo, Jemin ;
Lee, Joonho ;
Dosovitskiy, Alexey ;
Bellicoso, Dario ;
Tsounis, Vassilios ;
Koltun, Vladlen ;
Hutter, Marco .
SCIENCE ROBOTICS, 2019, 4 (26)
[9]   An optimal control strategy for hybrid actuator systems: Application to an artificial muscle with electric motor assist [J].
Ishihara, Koji ;
Morimoto, Jun .
NEURAL NETWORKS, 2018, 99 :92-100
[10]  
Kambara H, 2004, P ANN INT IEEE EMBS, V26, P486