Model-Free Robust Optimal Feedback Mechanisms of Biological Motor Control

被引:24
|
作者
Bian, Tao [1 ]
Wolpert, Daniel M. [2 ,3 ]
Jiang, Zhong-Ping [1 ]
机构
[1] NYU, Control & Networks Lab, Dept Elect & Comp Engn, Tandon Sch Engn, Brooklyn, NY 11201 USA
[2] Columbia Univ, Dept Neurosci, Zuckerman Mind Brain Behav Inst, New York, NY 10027 USA
[3] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
基金
英国惠康基金; 美国国家科学基金会;
关键词
ADAPTIVE OPTIMAL-CONTROL; CONTINUOUS-TIME; ARM MOVEMENTS; ADAPTATION; VARIABILITY; SYSTEMS; STABILITY; MEMORY; REWARD; SIGNAL;
D O I
10.1162/neco_a_01260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sensorimotor tasks that humans perform are often affected by different sources of uncertainty. Nevertheless, the central nervous system (CNS) can gracefully coordinate our movements. Most learning frameworks rely on the internal model principle, which requires a precise internal representation in the CNS to predict the outcomes of our motor commands. However, learning a perfect internal model in a complex environment over a short period of time is a nontrivial problem. Indeed, achieving proficient motor skills may require years of training for some difficult tasks. Internal models alone may not be adequate to explain the motor adaptation behavior during the early phase of learning. Recent studies investigating the active regulation of motor variability, the presence of suboptimal inference, and model-free learning have challenged some of the traditional viewpoints on the sensorimotor learning mechanism. As a result, it may be necessary to develop a computational framework that can account for these new phenomena. Here, we develop a novel theory of motor learning, based on model-free adaptive optimal control, which can bypass some of the difficulties in existing theories. This new theory is based on our recently developed adaptive dynamic programming (ADP) and robust ADP (RADP) methods and is especially useful for accounting for motor learning behavior when an internal model is inaccurate or unavailable. Our preliminary computational results are in line with experimental observations reported in the literature and can account for some phenomena that are inexplicable using existing models.
引用
收藏
页码:562 / 595
页数:34
相关论文
共 50 条
  • [21] Neural-network-based robust optimal control of uncertain nonlinear systems using model-free policy iteration algorithm
    Li, Chao
    Wang, Ding
    Liu, Derong
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4545 - 4550
  • [22] A robust inventory management in dynamic supply chains using an adaptive model-free control
    Nya, Danielle Nyakam
    Abouaissa, Hassane
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 179
  • [23] Practical model-free robust estimation and control design for an underwater soft IPMC actuator
    Khawwaf, Jasim
    Zheng, Jinchuan
    Wang, Hai
    Man, Zhihong
    IET CONTROL THEORY AND APPLICATIONS, 2020, 14 (11) : 1508 - 1515
  • [24] Conflict and competition between model-based and model-free control
    Lei, Yuqing
    Solway, Alec
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (05)
  • [25] Model-Free Predictive Current Control of Synchronous Reluctance Motor Drives for Pump Applications
    De Martin, Ismaele Diego
    Pasqualotto, Dario
    Tinazzi, Fabio
    Zigliotto, Mauro
    MACHINES, 2021, 9 (10)
  • [26] Model-free adaptive robust control based on TDE for robot with disturbance and input saturation
    Liu, Xia
    Wang, Lu
    Yang, Yong
    ROBOTICA, 2023, 41 (11) : 3426 - 3445
  • [27] Model-free cable robot control
    Blanchini, Franco
    Della Schiava, Luca
    Fenu, Gianfranco
    Giordano, Giulia
    Pellegrino, Felice Andrea
    Salvato, Erica
    IFAC PAPERSONLINE, 2023, 56 (02): : 550 - 555
  • [28] Movement Duration, Fitts's Law, and an Infinite-Horizon Optimal Feedback Control Model for Biological Motor Systems
    Qian, Ning
    Jiang, Yu
    Jiang, Zhong-Ping
    Mazzoni, Pietro
    NEURAL COMPUTATION, 2013, 25 (03) : 697 - 724
  • [29] Robust Control in Human Reaching Movements: A Model-Free Strategy to Compensate for Unpredictable Disturbances
    Crevecoeur, Frederic
    Scott, Stephen H.
    Cluff, Tyler
    JOURNAL OF NEUROSCIENCE, 2019, 39 (41) : 8135 - 8148
  • [30] Safe Model-Free Optimal Voltage Control via Continuous-Time Zeroth-Order Methods
    Chen, Xin
    Poveda, Jorge, I
    Li, N.
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 4064 - 4070