Gradual Acquisition of Feed-Forward Control in Repetitive Motions by State-Independent Reinforcement Learning

被引:0
作者
Mamiya, Haruki [1 ]
Kobayashi, Yuichi [1 ]
机构
[1] Shizuoka Univ, Grad Sch Sci & Technol, Dept Engn, Chuo Ku, 3-5-1 Johoku, Hamamatsu, Shizuoka, Japan
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, AIM 2024 | 2024年
关键词
D O I
10.1109/AIM55361.2024.10637084
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human motor control is characterized by its adaptability to new dynamics. As a result of adaptation, humans can achieve motor control with less computational effort while maintaining achievement of the task. In this paper, we hypothesize that such adaptation can be modeled by acquisition process of a feed-forward control sequence. Based on the hypothesis, we propose a state-independent reinforcement learning model of feed-forward control generation and feedback control reduction. A gradual learning strategy is presented on the basis of state-independent and time-dependent reinforcement learning to improve learning efficiency for repetitive tracking control tasks. The proposed motor learning model was validated in simulation of 2-DOF manipulator tracking control task, where the robot could obtain a state-unaware control sequence under unknown dynamics and external force condition.
引用
收藏
页码:192 / 197
页数:6
相关论文
共 21 条
[11]  
Kane T. R., 1969, International Journal of Solids and Structures, V5, P663, DOI 10.1016/0020-7683(69)90086-9
[12]  
Katayama M., 1990, Advances in Neural Information Processing Systems, V3
[13]   Automatic controller generation based on dependency network of multi-modal sensor variables for musculoskeletal robotic arm [J].
Kobayashi, Yuichi ;
Harada, Kentaro ;
Takagi, Kentaro .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 118 :55-65
[14]   The perceptual shaping of anticipatory actions [J].
Maffei, Giovanni ;
Herreros, Ivan ;
Sanchez-Fibla, Marti ;
Friston, Karl J. ;
Verschure, Paul F. M. J. .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2017, 284 (1869)
[15]   Grid-Based Estimation of Transformation Between Partial Relationships Using a Genetic Algorithm [J].
Nakamura, Sota ;
Kobayashi, Yuichi ;
Matsuura, Taisei .
JOURNAL OF ROBOTICS AND MECHATRONICS, 2022, 34 (04) :786-794
[16]  
Nakano D., 2014, Transactions of the Institute of Systems, Control and Information Engineers, V27, P327
[17]   Modeling human postural sway using an intermittent control and hemodynamic perturbations [J].
Nomura, Taishin ;
Oshikawa, Shota ;
Suzuki, Yasuyuki ;
Kiyono, Ken ;
Morasso, Pietro .
MATHEMATICAL BIOSCIENCES, 2013, 245 (01) :86-95
[18]   A Markov chain approximation of switched Fokker-Planck equations for a model of on-off intermittency in the postural control during quiet standing [J].
Suzuki, Yasuyuki ;
Togame, Keigo ;
Nakamura, Akihiro ;
Nomura, Taishin .
COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2023, 126
[19]  
Theodorou EA, 2010, J MACH LEARN RES, V11, P3137
[20]  
Vilanova R., 2008, Proceedings of the 17th IFAC World Congress, P11310